♦ 
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Figure 1 



cgagagacgacagaaggttacggctgcgagaagacgacagaagggtccagaaaaa 
ggaaagtgctggaggggagtggggacaaaagcagcgaccaagtgaatgtcacttc 
agtgactgaggccaggcaaaacgcgcgggaaggattttgtgtagcttgggaccctt 
tcatagacactgatgacacgtttacgcaaaatagaaatttgaggagaaacgcctgg 
gccttcggaaaggagtgattgattagtacttgcaagtttaggtgactttaaggagaa 
ctaactaatgtatactattgagggaggaggaagagcattacagagtttcc:agcagc 
agcaggaaagctttggttaatttggaaatggatgatagcattaaaataacagaagc 
gcctccaggtctctgaagcttcagtcccccagctgaaagccagaaaagactaagcc 
cactaagccttttgatccctttggaagcaaagaactttccttccctggggtgaagac 
tctcctcagaagatttcctgtctctgcctatgttacaagaggaatcaaaaccaagac 
agaagagctcaggatgcaggtgagaggcagggaagtcagcggcttgttgatctccc 
tcactgctgtctgcctggtggtcacccctgggagcagggcctgtcctcgccgctgtg 
h cctgctatgtgcccacagaggtgcactgtacatttcggtacctgacctccatcccag 
j atggcatcccggccaatgtggaacgaataaatttaggatataacagccttactagat 
=0 tgacagaaaacgactttgatggcctgagcaaactggagttactcatgctgcacagt 
13 aatggcattcacagagtcagtgacaagaccttctcgggcttgcagtccttgcaggtc 
in ttaaaaatgagctataacaaagtccaaatcattcggaaggatactttctacggactc 
!* gggagcttggtccggttgcacctggatcacaacaacattgaattcatcaaccctgag 
! | gccttttatggacttacctcgctccgcttggtacatttagaaggaaaccggctcaca 
' y aagctccatccagacacatttgtctcattaagctatctccagatatttaaaacctctt 
u tcattaagtacctgttcttgtctgataacttcctgacctccctcccaaaagaaatggt 
\i ctcctacatgccaaacctagaaagcctgtatttgcatggaaacccatggacctgtga 
u ctgccatttaaagtggttgtctgagtggatgcagggaaacccagatataataaaat 
i=j gcaagaaagacagaagctcttccagtcctcagcaatgtcccctttgcatgaacccca 
s ggatctctaaaggcagaccctttgctatggtaccatctggagctttcctatgtacaa 
i* agccaaccattgatccatcactgaagtcaaagagcctggttactcaggaggacaat 
ggatctgcctccacctcacctcaagatttcatagaaccctttggctccttgtctttga 
acatgacananntntctggaaataaggccgacatggtctgtagtatccaaaagcca 
tcaaggacatcaccaactgcattcactgaagaaaatgactacatcatgctaaatgc 
gtcattttccacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtg 
gcaacttctggctttatacagtgactctcctctgatactagaaaggaagccccagct 
taccgagactccttcactgtcttctagatataaacaggtggctcttaggcctgaaga 
catttttaccagcatagaggctgatgtcagagcagaccctttttggttccaacaaga 
aaaaattgtcttgcagctgaacagaactgccaccacacttagcacattacagatcca 
gttttccactgatgctcaaatcgctttaccaagggcggagatgagagcggagagac 
tcaaatggaccatgatcctgatgatgaacaatcccaaactggaacgcactgtcctgg 
ttggcggcactattgccctgagctgtccaggcaaaggcgacccttcacctcacttgg 
aatggcttctagctgatgggagtaaagtgagagccccttacgttagcgaggatggg 
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CGAATCCTAATAGACAAAAATGGGAAGTTGGAACTGCAGATGGCTGACAGCTTTGA 

TGCAGGTCTTTACCACTGCATAAGCACCAATGATGCAGATGCGGATGTTCTCACATA 

CAGGATAACTGTGGTAGAGCCCTATGGAGAAAGCACACATGACAGTGGAGTCCAGC 

ACACAGTGGTTACGGGTGAGACGCTCGACCTTCCATGCCTTTCCACGGGTGTTCCAG 

ATGCTTCTATTAGCTGGATTCTTCCAGGGAACACTGTGTTCTCTCAGCCATCAAGAG 

ACAGGCAAATTCTTAACAATGGGACCTTAAGAATATTACAGGTTACGCCAAAAGAT 

CAAGGTCATTACCAATGTGTGGCTGCCAACCCATCAGGGGCCGACTTTTCCAGTTTT 

AAAGTTTCAGTTCAAAAGAAAGGCCAAAGGATGGTTGAGCATGACAGGGAGGCAG 

GTGGATCTGGACTTGGAGAACCCAACTCCAGTGTTTCCCTTAAGCAGCCAGCATCTT 

TGAAACTCTCTGCATCAGCTTTGACAGGGTCAGAGGCTGGAAAACAAGTCTCCGGTG 

TACATAGGAAGAACAAACATAGAGACTTAATACATCGGCGGCGTGGGGATTCCACG 

CTCCGGCGATTCAGGGAGCATAGGAGGCAGCTCCCTCTCTCTGCTGGGAGAATTGAC 

CCGCAACGCTGGGCAGCACTTCTAGAAAAAGCCAAAAAGAATTCTGTGCCAAAAAA 

GCAAGAAAATACCACAGTAAAGCCAGTGCCACTGGCTGTTCCCCTCGTGGAACTCA 

CTGACGAGGAAAAGGATGCCTCTGGCATGATTCCTCCAGATGAAGAATTCATGGTTC 

TGAAAACTAAGGCTTCTGGTGTCCCAGGAAGGTCACCAACTGCTGACTCTGGACCAG 

TAAATCATGGTTTTATGACGAGTATAGCTTCTGGCACAGAAGTCTCAACTCrTGAATC 

CACAAACACTACAATCTGAGCACCTTCCTGATTTCAAATTATTTAGTGTAACAAACG 

GTACAGCTGTGACAAAGAGTATGAACCCATCCATAGCAAGCAAAATAGAAGATACA 

ACCAACCAAAACCCAATCATTATCTTTCCATCAGTAGCTGAAATTCGAGA1TCTGCT 

CAGGCAGGAAGAGCATCTTCCCAAAGTGCACACCCTGTAACAGGGGGAAACATGGC 

TACCTATGGCCATACCAACACATATAGTAGCTTTACCAGCAAAGCCAGTACAGTCTT 

GCAGCCAATAAATCCAACAGAAAGTTATGGACCTCAGATACCTATTACAGGAGTCA 

GCAGACCTAGCAGTAGTGACATCTCTTCTCACACTACTGCAGACCCTAGC1TCTCCA 

GTCACCCTTCAGGTTCACACACCACTGCCTCGTCTTTATTTCACATTCCTAGAAACAA 

CAATACAGGTAACTTCCCCTTGTCCAGGCACTTGGGAAGAGAGAGGACAATTTGGA 

GCAGAGGGAGAGTTAAAAACCCACATAGAACCCCAGTTCTCCGACGGCATAGACAC 

AGGACTGTGAGGCCAGCAATCAAGGGACCTGCTAACAAAAATGTGAGCCAAGTTCC 

AGCCACAGAGTACCCTGGGATGTGCCACACATGTCCTTCCGCAGAGGGGCTCACAG 

TGGCTACTGCAGCACTGTCAGTTCCAAGTTCATCCCACAGTGCCCTCCCCAAAACTA 

ATAATGTTGGGGTCATAGCAGAAGAGTCTACCACTGTGGTCAAGAAACCACTGTTAC 

TATTTAAGGACAAACAAAATGTAGATATTGAGATAATAACAACCACTACAAAATAT 

TCCGGAGGGGAAAGTAACCACGTGATTCCTACGGAAGCAAGCATGACTTCTGCTCC 

AACATCTGTATCCCTGGGGAAATCTCCTGTAGACAATAGTGGTCACCTGAGCATGCC 

TGGGACCATCCAAACTGGGAAAGATTCAGTGGAAACAACACCACTTCCCAGCCCCC 

TCAGCACACCCTCAATACCAACAAGCACAAAATTCTCAAAGAGGAAAACTCCCTTG 

CACCAGATCTTTGTAAATAACCAGAAGAAGGAGGGGATGTTAAAGAATCCATATCA 

ATTCGGTTTACAAAAGAACCCAGCCGCAAAGCTTCCCAAAATAGCTCCTCTTTTACC 

CACAGGTCAGAGTTCCCCCTCAGATTCTACAACTCTCTTGACAAGTCCGCCACCAGC 

TCTGTCTACAACAATGGCTGCCACTCAGAACAAGGGCACTGAAGTAGTATCAGGTG 
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CCAGAAGTCTCTCAGCAGGGAAGAAGCAGCCCTTCACCAACTCCTCTCCAGTGCTTC 

CTAGCACCATAAGCAAGAGATCTAATACATTAAACTTCTTGTCAACGGAAACCCCCA 

CAGTGACAAGTCCTACTGCTACTGCATCTGTCATTATGTCTGAAACCCAACGAACAA 

GATCCAAAGAAGCAAAAGACCAAATAAAGGGGCCTCGGAAGAACAGAAACAACGC 

AAACACCACCCCCAGGCAGGTTTCTGGCTATAGTGCATACTCAGCTCTAACAACAGC 

TGATACCCCCTTGGCTTTCAGTCATTCCCCACGACAAGATGATGGTGGAAATGTAAG 

TGCAGTTGCTTATCACTCAACAACCTCTCTTCTGGCCATAACTGAACTGTTTGAGAA 

GTACACCCAGACTTTGGGAAATACAACAGCTTTGGAAACAACGTTGTTGAGCAAAT 

CACAGGAGAGTACCACAGTGAAAAGAGCCTCAGACACACCACCACCACTCCTCAGC 

AGTGGGGCGCCCCCAGTGCCCACTCCTTCCCCACCTCCTTTTACTAAGGGTGTGGTT 

ACAGACAGCAAAGTCACATCAGCTTTCCAGATGACGTCAAATAGAGTGGTCACCAT 

ATATGAATCTTCAAGGCACAATACAGATCTGCAGCAACCCTCAGCAGAGGCTAGCC 

CCAATCCTGAGATCATAACTGGAACCACTGACTCTCCCTCTAATCTGTTTCCATCCAC 

TTCTGTGCCAGCACTAAGGGTAGATAAACCACAGAATTCTAAATGGAAGCJCCTCTCC 

CTGGCCAGAACACAAATATCAGCTCAAGTCATACTCCGAAACCATTGAGAAGGGCA 

AAAGGCCAGCAGTAAGCATGTCCCCCCACCTCAGCCTTCCAGAGGCCAGCACTCAT 

GCCTCACACTGGAATACACAGAAGCATGCAGAAAAGAGTGTTTTTGATAAGAAACC 

TGGTCAAAACCCAACTTCCAAACATCTGCCTTACGTCTCTCTACCTAAGACTCTATTG 

AAAAAGCCAAGAATAATTGGAGGAAAGGCTGCAAGCTTTACAGTTCCAGCTAATTC 

AGACGTTTTTCTTCCTTGTGAGGCTGTTGGAGACCCACTGCCCATCATCCACTGGACC 

AGAGTTTCATCAGGANTTGAAATATCCCAAGGGACACAGAAAAGCCGGTTCCACGT 

GCTTCCCAATGGCACCTTGTCCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCTCTGCATTTAATCCACTGGGCGTAGACCATTTTCATGTCTCTTTGTCTGTG 

GTTTTTTACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTrCACTTT 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAATCTGAGTCTTTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGCjTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCTTTATGATGGGACTGAACTAAAACCATTGCAGTTGACTCATT 

CCAGATTTTTCTTGTATCCAAATGGAACTCTGTATATAAGAAGCATCGCTCCTTCAGT 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCTTACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACTGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATTGACACCTGCCAAAATTGAACA 

GAAGCAGTATTTTAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 
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AGGCCTCTGGCTCCCCTGTGCCTGAGGTATCCTGGAGTTTGCCTGATGGGACAGTGC 
TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 
TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGCrAGATTAT 
ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 
TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 
CTGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 
TTTTGGTTGCTGCCTTCCAACAATGTCATTTCATTCTCCAATGACAGGTTCACATTTC 
ATGCCAATAGAACTTTGTCCATCCATAAAGTGAAACCACTTGACTCTGGGGACTATG 
TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 
GTCTCTAAACCTCCATTAATCAATGGCCTGTATGCAAACAAGACTGTTATirAAAGCC 
ACAGCCATTCGGCACTCCAAAAAATACTTTGACTGCAGAGCAGATGGGAi:CCCATCT 
TCCCAGGTCACGTGGATTATGCCAGGCAATATTTTCCTCCCAGCTCCATACTTTGGA 
AGCAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 
,« TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTCiTGTTGGT 
.'o AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 
ip ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACTGCTCTGTGGAT 
in GGGAACCCCCCACCTGAAATTACCTGGATCTTACCTGACGGCACACAGTTTGCTAAC 
m AGACCACACAATTCCCCGTATCTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 
!■* GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 
! U C ATCGAGA AACTCATCCTGTTAGAGATTGGGC AGA AGCC AGTC ATTCTGAC ATACGA 
'0 ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTCrTGTCTG 
! w ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 
H AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 
% f AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 
GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 
£ ACTACCTACCCAGGAACATGCTCAGGAGGACAGGGGAAGCCATGCAGCTCCACTGT 
U GTGGCCTTGGGAATCCCCAAGCCAAAAGTCACCTGGGAGACGCCAAGACACTCCCT 
GCTCTCAAAAGCAACAGCAAGAAAACCCCATAGAAGTGAGATGCTTCACCCACAAG 
GTACGCTGGTCATTCAGAATCTCCAAACCTCGGATTCCGGAGTCTATAAGTGCAGAG 
CTCAGAACCTACTTGGGACTGATTACGCAACAACTTACATCCAGGTACTCTGACAGG 
AAGGGGGAGACTAAAATTCAACAGAAGTCCACATCCACAGGGTTTATTTITTGGAA 
GAAGTTTAATCAAAGGCAGCCATAGGCATGTAAATGAGTCTGAATACATITACAGT 
ATTAAATTTACAATGGACATGCGATGAGACTTGTAAATGAAAGCATTGTGAACTGA 
AACCGAGTCTCTGTGGATCTCAAAGCAAACTCTTAACTTAAGGCACTTTGATTTTGC 
CAACAAATAATAACAAACATTAAGAGAAAAAAATGATCCACTACGAAATAACAAAC 
GGCTAATGCACCTGAATTCTCAGTAAAAAGACCTTTCTCTCGCTAACAGTl^GCCAGC 
TGCCTCGTGTCTGTTTCCTACCAATGTCACAAACATCGCACACAGGGTGAATGGAGT 
CAACGGGAAAGATTAAGTTTGCGGTCTGTGTAAATCTCAATGTACAAATATTCTGTC 
NCTGGTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAA (SEQ ID NO: 1) 
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Figure 2 




(3266) 



EcoRI (5579) 
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Figure 3 

MQVRGREVSGLLISLTAVCLVVTPGSRACPRRCACYVPTEVHCTFRYLTSIPDGIPANVE 

RINLGYNSLTRLTENDFDGLSKLELLMLHSNGIHRVSDKTFSGLQSLQVLKMSYNKVQn 

RKDTFYGLGS LVRLHLDHNNIEFINPEAFYGLTS LRL VHLEGNRLTKLHPDTFVS LS YLQI 

FKTSFIKYLFI^D>M.TSLPKEMVSYMPNLESLYLHGNPWTCDCHLKWLSEWMQGNPDI 

IKCKKDRSSSSPQQCPLCMNPRISKGRPFAMVPSGAFLCTKPTIDPSLKSKSLVTQEDNGS 

ASTSPQDFffiPFGSLSLNMTXXSGNKADMVCSIQKPSRTSPTAFTEENDYIMLNASFSTNL 

VCSVDYNHIQPVWQLLALYSDSPLILERKPQLTCTPSLSSRYKQVALRPEDIFrSIEADVR 

ADPFWQQEKrVLQLNRTATTLSTLQIQFSTDAQIALPRAEMRAERLKWTMILMMNNPK 

LERTVLVGGTIALSCPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGECLELQMA 

DSFDAGLYHCISTNDADADVLTYRITVVEPYGESTHDSGVQHTVVTGETLDLPCLSTGV 

PDASISWBLPGNTVFSQPSRDRQILNNGTLRILQVTPKDQGHYQCVAANPSGADFSSFKV 

SVQKKGQRMVEHDREAGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 

NKHRDLIHRRRGDSTLRRFREHRRQLPLSARRIDPQRWAALLEKAKKNSVPJCKQENTTV 

KPVPLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSI 

ASGTEVSTVNPQTLQSEHIJDFKLFSVTNGTAVTKSMNPSIASKIEDTTNQNP'IIIFPSVAE 

IRDSAQAGRASSQSAHPVTGGNMATYGHTNTYSSFTSKASTVLQPINPTESYGPQIPITGV 

SRPSSSDISSHTTADPSFSSHPSGSHTTASSIJFHIPRNNNTGNFPI^RHLGRERTIWSRGRV 

KNPHRTPVLRRHRHRTVRPAIKGPANKNVSQVPATEYPGMCHTCPSAEGLTVATAALS 

VPSSSHSALPKTNNVGVIAEESTTVVKKPLLLFKDKQNVDffinTTTTKYSGGESNHVlPTE 

ASMTSAPTSVSLGKSPVDNSGHLSMPGTIQTGKDSVETTPLPSPLSTPSIPTST1SFSKRKTP 

LHQIFVNNQKKEGMLKNPYQFGLQKNPAAKLPKMPLIJTGQSSPSDSTTLLTSPPPALST 

TMAATQNKGTEVVSGARSLSAGKKQPFTNSSPVLPSTISKRSNTLNFLSTETPrVTSPTAT 

ASVMSETQRTRSKEAKDQIKGPRKNRNNANTTPRQVSGYSAYSALTTADTPLAFSHSP 

RQDDGGNVSAVAYHSTTSLLAITELFEKYTQTLGNTTALETTLLSKSQESTTVKRASDTP 

PPLLSSGAPPVPTPSPPPFTKGWTDSKVTSAFQMTSNRVVTIYESSRHNTDLQQPSAEAS 

PNPEffTGTTDSPSNIJTSTSWAIJlVDKPQNSKWKPSPWPEHKYQLKSYSETIEKGKRPA 

VSMSPHI^LPEASTHASHWNTQKHAEKSVFDKKPGQNPTSKHLPYVSLPKTIXKKPRnG 

GKAASFTVPANSDVFLPCEAVGDPLPIIHWTRVSSGXEISQGTQKSRFHVLPNGTLSIQRV 

S IQDRGQ YLCS AFNPLG VDHFH VSLS V VFYPARILDRH VKEITVHFGSTVELKCRVEGMP 

RPTVSWILANQTVVSETAKGSRKVWVTPDGTLnYNLSLYDRGFYKCVASNPSGQDSLL 

VKIQVITAPPVIlEQKRQAiVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFFLYPNGTLYIRSIAPSVRGTYECIATSSSGSERRVVELTVEEGETffRIETASQKWTEVN 

LGEKLLLNCSATGDPKPRIIWRLPSKAVIDQWHRMGSRfflVYPNGSLVVGSVTEKDAGD 

YLCVARNKMGDDLVLMHVRLRLTPAKIEQKQYFKKQVLHGKDFQVDCKASGSPVPEV 

SWSLPDGTVLNNVAQADDSGYRTKRYTLFHNGTLYFNNVGMAEEGDYICSAQNTLGK 

DEMKVHLTVLTAIPRIRQSYKTTMRLRAGETAVLDCEVTGEPKPNVFWLLPSNNVISFS 

ISroRFTFHANRTLSIHKVKPLDSGDYVCVAQNPSGDDTKTYKLDIVSKPPLINCiLYANKT 

VIKATAIRHSKKYFDCRADGIPSSQVTWIMPGMFLPAPYFGSRVTVHPNGTLEMRNIRLS 
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DSADFTCVVRSEGGESVLVVQLEVLEMLRRPTFRNPFNEKVIAQAGKPVALNCSVDGNP 

PPEITWILPDGTQFANfRPHNSPYLMAGNGSLrLYKATRNKSGKYRCAARNKVGYIEKLIL 

LEIGQKPVILTYEPGMVKSVSGEPLSLHCVSDGIPKPNVKWTTPGGHVIDRP(^VDGKYIL 

HENGTLVIKATTAHDQGNYICRAQNSVGQAVISVSVMVVAYPPRIINYLPRhfMLRRTGE 

AMQLHCVALGIPKPKVTWETPRHSLLSKATARKPHRSEMLIiPQGTLVIQNLQTSDSGVY 

KCRAQNLLGTDYATTYIQVL (SEQ ID NO: 2) 
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Figure 5 
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Figure 7 

Nncl 

1tIXX3AACCAACC(^G^TGC^ 

TCCAATGQAATACTACTCAACTTATTAAAAACAACGACITTCATAAAA 
0>W.TQOTCT(BIAGQ\TCT1XjAQTGAGOTAACCCAATCACAA^ 
ACTCACTQATAAQTOGCTATTTXSTCTATCKl^ 
TTrrTOTOrGTATAATATTAAOAKKSAAATTTaOC^ 
TOTTTCAOTGQAmCXXKMAGAAAATAATAAGTOAACTGTAGCLU 

-I Noc 390 

AOOTOAO^AAOTATCXTOACACATT qG^^ 
OGATTCATATrAACTCCAQTTCCCCATOTqCAGCrCTO 
AACTTCACTACKriTCCTACACA(UGCAOTTACTATCCITCy:ACT^ 
CC^rrAAOCroCACTGAAAC ATGAOTC^ACmm-AATAATCTT 

TAATTTTAT<K?TAOAAATX}TCAACAAGCTTOT(^AQGCTQflAAGATA^ 

(MTCKKXTITTC^GriTroAAAOTAATTCAGTATGTQ 

ATn^CCCCCCAAGTAG^ 

TCTTCTAAAAAQACTITQGTATGCATAGGATCTAAATOTAA ATGTQATTTACTGACATAAT 

^^^^^^^^^^^^^^ ^ A ^^^ A ^^^^ q ^^^^^^ ^^^ r 

AOTAQOOAT(XnXXX3AAAATAOOTACTCT^ 
GOAQCTOAOGTCTGqAAOGCTCQACCmyyrrAGAAATAACX^ 

(UI^TATTTTX^OGCAA ^ 

^CTOTAATTCTAC^^ 
0ATATGQAATAITACrAATITQTACC<^ 

ATTACCTGAOATGAGTCATTrrKnTCACAAGTCTTTTTAflAAO 

CTOATOAAGTMO<^TATAAAGTOAOAOCAGCATGA^ 

^ATTATMI^ 

QAGTAAATCTTCCATAAATATf^TCTGCATTTATAAAre 
yGOTOQGCCTOAGAOATGGAGCrrAAaAACTTACTOK?^ 
AC^OQGATrAAACAOTCWrrTTAGTCACAT^^ 
MCMCMCGACAACAAACCCATAAAAArTATCACCK^AA^ 

CA^CK^QVACTACAQAGTGAAOTAATa^GAQACTTAAT^ 

T^AATAATTCTATTTMGATACAC^GCAAAQMQTCT^ 

ATAQAQAOAATAMGTGATAGATACGTTTGTriTATCTTCATGT^ 

TOTACACCTCAAACATAGATAACC^CAAAflAGQ AAQAGOATAnflTOCCTCrt^ 
GGAAGAGTACATTCGAAAQTCAGACAC^ATTCyrnTAQATOTA 

CCA^CTOGAAACAOAO<KXX^rn^CrTGrAAA Cm 
A3TAOCAqq0CC*AAAA«X3O^ ^ 

QGACTTTTGGTATAGCATTGGAAATGTAAATOAnTTAAATArrTA ata a a a a A-ira^IT^ 

4 Nbc2817 

AAAAAAAAAAAAAAAAAAAAAGGAAGGTCAGACACXnX^CTTCA 
GCAAACAGAAGG<MAGTCACAAA(XCAGGACAAACCACAGTGAT^ 
TQTTATTXKriXjTTQTTGTTACCACCATCATTAOCATATATTCATTGT^ 
CTATGAC^TGTITrmATTCAAGTATATCACATGCTO^ 
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GCCC^GCCTCCTTTXXXXXXXXEC^ 

CACACACACACACACACACACACACCrrrACCrTCTCCTCKK^ 

CACCCAAQCTTAATCCTTTTCClTaXTOCAATAGTACC^^ 

CCCCCTCXXXXTOTTAGOAGATGGQAGAGQTCACOAAAOGAAAOAATTT^ 

OCCA(KXXXXXXX^CA0A(XX3tXX^^ 

<KiAaOAGCAAACTCOa\OOOQAGOCftX^^ 

CCCATTOCX^aAAGOCOAG<Kn-ATAQCTGCGTX^ 

QAGCO<X7ITXnV^ATXXXOTCGQa 

*Exoal(Nuc3«61) 

GCTGGGGTTCAGGGCGAGACAGAOQGTGGOATGGQCAGAAGOGTCX^GGAAAAGGAAAGT 

ACTTCAG(ftKMOTTO<H?ACAAAA<^<XXUCC^ 

GGCAAAAGGAGCXKXJAAGGATTATATGTAGCCItHXJAOGCTT^ 

TTGTKX^AAGCAAGCAATTTGAGGAGAAACGCCItKXiACX; 

AGTACITGTAAGTrrAGOTGAGTTTCAGAACrAACTAACXTATACTATTGA 

ACK^TrcCAGCACK>G<^GCAGaU^GCAATCAGATA^ 

ATGTATGATACCATTAAAATAACAGAAC<3GCCTCCAGTIXrrCrc 

GTGAAGACTAAGCCrACTAAGCCrrTTGCIX^^ 

TGAAG<XTCTCCTCAGAAGATrrCCT(7rCTCra 

AGAAGACgrCA COTATTQCCAACTCTTT 



4 the 4293 
AO! 



TGTTAAATACAQTTTGAQO<m*AGTOTACGGGAACTCAT^ 
CTTCIXTTrATAACTAACTCTTAAra 
CAlLlimiTlATCTAATAATAflCAAqcreATCrKK^ 

CAAAAATATTCAnTGATOTAACAGTGACAGTGTAGGCATAGAAOTAATCATTAOTAAATC 

TTAATITGGGTTAAACrCATTCATAA^GCTOCA OOTIOQOAGQGATCACTG 

CACGTGCQGGTTAAAGATATTnCTAACAAGAGAAGCAQA^ 

CCATCACTOTtnt^OTAAOCAOAOOCK?TOITrCCAA(K^ 

TCXXTOCAAAGTCAGAOACTCAQCCCTCTCAO^ 

AGTTGGCrCrGAAAAGGCCCATGTQTCCTTATTGGCAAGQACTrGCAQA 

GAAATTTGAOCmTTITCTAGTGQGTTATTACAGCTOT 

CCAAATAAATAAAACACATATTAAATAATACAATGTTACAAAAATroATCATATAAAGAA 

QTACATTCATAAATOCAATGTGAAAAATATATATAAT^^ 

rrTTCTAAATTGCACATCTACTATrrrTATATTTATAAAAATATTm 

AGTGTAAAAQGCrCTTGCTr^AACAAOAQAGTTAAATTTACAAAC^ 

AACATTATTATOATCTXTTAATGA^GGGATa^^ 

AAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCX^ATGGGAAGCTAGCAC/iC 

ACTCGTrTATAAAGAGAACAGTTTCCTOAGTt^ATGCAAGTTTA C^ 

AOTTtXTCTTTCTOCAGATTtt^^ 

CATTATAAOTQQAATAAQATA GQATCTCAAAGQAATOT(^^ 
CTTTCAOQTCriTCTTTCAGTT(^CT 

TAATTTTTTTQCAQTTCTTATOQAATTCCAAAAAACAAAAA^ 

ACCTCTGAAACTAGAACTACCAATOCATTACTQQOTATOrAACAAA GAGA 

GAATTTATTGCTACATTGTTCATrATTCAC^ 

GCCGTCAAAATATCAACGOATAAAGAAAATGTGGAAATGTGTACAAGAQACriY^ 
OCCATAAAAQAGTQAAATCATGACATATGCAGQAAATQGATGCAA CItKiAAAT^ 
OOCTAATCAAAACAAGACAGACTCAAAAAGGAAACACCQTOTAGCTTCrCT^ 
GAAGCTAGATTTACACTIOTACQT<XX]CATGTGTOT^ 

ATTCTAATCrQTGAOTGTGTATAAAGGGATGCATGTAAAGCAA AA ACAAGCT AGCIGGCiG 
TQQGTAGGAQAGAAAGCAATGAGAGGAGTTAATAAQAACQAAGCATAGTAACATAGGirG 
CCAOOATOAA ATCK^TrAATrre rATQ^ 

AACCAGGGTTOAAATCCCAOCAGAGAGAAGGGOAAOTAOACACAA^ 

CAAOAAQCCATTTQCAOTTGCTGCCTGCTQOGAQCX3G^ 

ACmTGTATAACMCCAQTI^ACAATACAAAGTTGGC^TGATQGATG^ 

TTIX^TTTl'lUTnTIiACTGlTrrOTTGTTGTQQTGGTTa^ 

ATTTGTTTQrTTTGAOAGAQAGAAQQAACATGAAATTGGOTOGGTAGGAA 

ATCTGGAAGAAGTTQGGGAAAGAGAAAAATTGTATGGAGC\TATTTAAACAAACAAACA 

AACAAACAAAAGOTrt^TTTTQCCAGAAAAAGGTGTGAATTA AATTAAC^^ 

CTTAAAGAAAATATTOCCAATTATrCCCAGAGTKKn'ATtyrATflCTO 



• # 
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CirQAACTQQCCCTATAACI^^ 

TGTCCTXXJCCGATCTCK^^ 
TCCATCCCAGACttW^TO^^ 



GGACCTTCXXHXlATCnXX^^ 
AAACACCTGTGATTACrrTTAATAUl llUXJriCTAAAATGG<TITCATACAAACCTTATATTG 
TGOAGACAATOAACATTTTATCXX1AATAGTCTTITACT 
GTTIXKK)AQCCTCATAATTATOQGOCA<KTITATrc 
A<^GTTTCTGTTAACAATCATT^^ 
AAAAGCATTTCTTTTGCAATI^ 
CCTTACTAAATCTGACAAATTA^ 
ATAAATGCAC^GAAATGGOATATGCCATCACCT 
ATGCCAAAC^CTATTGTTCACGTTK3GAGGA^ 
TATAAOOTATACXX^OTGCCTlXSaAAGC^ 
CnXXXnATOAOGGAGGGGO^ 
GGCAGGTTTTATOATAAtnTGA^ 

gagaacatgcctgaagaagatcagattataggcaagcct^ 

tgattcatgggg<^gggcxx:agtcc^^ 

aaaaaaaaacaggctgagcaagtgixi^aggagcaa 

TCranGCATCAGCTTO^ 
GAGTGATGAACAATGATGTGGAACTATAAG< 
TTGGTCATGATGTITCATCACAGTGATAAm 
CCTnXKJACTAAATICrGTTATCTATACK^ 
CTACACCAGAGTGTAATCTGGGCCTGAG^ 
GKKKKnTCTTAACCGGAACACAGTTOT 



TGTOGACnTrAATTGATOATrCATCTGAAGCT^^ 
CTCACTAGATTG ACAGAAA4TC ACTTTTCTGGCCTGAGC^ CA 
G<^TG«^TrCACA<^GTCAGltMCAA 



GTGAGATAGQTAGAGGGTGATXKxAGCKJK3AGAAflAGAGGTGCA^ 
AAAAGKnX3CTGATTCmiTGGGAOACATTCT 
TC^GGAAGGATTTCAATTGTAATGi^ 
CTGGGGAAAATGCTTACACAATTGCGAGGACCTAATTrAG^^ 
AAAGCACAGCATGGCGGCAGACACXnXXAATTCCT 
CCCAQAGACK^TTQGCCAA^ 
ACTTOACTCAGAAACTAATGTGGAAAGCATC 
CATGCATGAATAAGGGATCCCAGAGAGAAGGGAAQAAAAAGG^ 
M<3GAAGGAAGGAAGGAAGGAAGGAAGGAAGGMGAGAGGO\GGAAAGGAGGGA 
AAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAGGAAAGGG 
GOAOAAAGAAGAGAAGAAAGGAAAATAAATAAATTTr^ 
TTTTATCCATAAAAGGTCATTIXXACCT^ 
AAGGGCAGTCnTTAACATAGTAGCATTTTATA 
TTmATCCTCTACCATCITCAAACTO 
TCTAAGAGTTTTCAiCACACCGAAGTG^ 
AATGGGGAGAACAAAAAGAGACTGCACAGGGAG<XCTTTCT^ 
TTGAGAAGCTAAITGACGCTGCATGACTACT^ 
ATOAAAAGCACAATAAAGTACATATG'ItXX^ATAA 
ATAATAGCAATGAAAGCAATAAC^CCCACTGTTCAC^ 
ACrATGTXJTAGTGTTTTOACrrAGOOT 
ATTTAGOTCATCAGACAGCATTTGGAACATGTATCrTC 
GGATGAAATTCTW 

GGGTTrGAGGTTTT ACIXX)^ Q 

CTTTTATTTATGAGGGATGTTGGTATira 

GGTGAAAACAGAAAAATATGTAAAAAGTTATTGTTTTTA^ 

TTTTAAAAATGGTATGOTAGAACTAA^ 

AGAGAGOGATTTGATGAATGCTAAAG<^^ 

CAGAATCAATTAAATTX^TATTACTATAAAAGAOW^ 

GAGTGGATAAACTOTGTAACXJTGAGTGCTATO^^ ~ 



fNuc6755 



E*afl3(NttcS039)^ 
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TGTQCGCIOCAACATCITGAAAACATTO 

ACATATTCK^TGOTTATGTrTATATQAAATGriTAAAATACATGGATTOT 

GTAAGATGTTACrrAGGGTCAGGAAAAQATTAAA^ 

ATCITAATTTOGGOAAAAOACAATTTCCTAAGACOAAAm 

ATCCCTGTGGATATTGTAATAAACCAGCATOCT<nXrcTCT 

GCAGGAGGAAGTGAAATGAGATGGTAGAAAGGAAAGTCATATAlXATGGCT 

GGTGGAATCTAGATATQTTAATATATTGA(^TAAAOGAAGGAATTGTTTAG<jGAAGCiA^ 

AAAAa^CAOGAGTQAGGGAGACAATAGGAACCAATGAQAGO<^AA 

TGTGTGTGGAGA<^CCATAATAAAA£ri^^ 

TAAAAACAAAACATTTTTG<^CAAGAATTATTTATTATTC 

GAAGlTGAAGTTCATIXjATAGTCrrcATAAATCTrAAATG^ 

mATrATrAAmcrmacTGrc^^ 

ATGCTTTTCACCATA(^CAAATGTAGGCAGAAGAGTGTAATCCACTTA 
CK>AOAG<K>AAAG<3AA0a3<^^ 

ATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCATAGTCCACTACATTACnTrc 

TATTCTAATATrAA<MAAATAATAAAa^ 

TTATCirOGCCACAGAIXXStncrcACTOCAlXXn^ 

AATCX^TOCAGOOCATGATaCTTACICCTACACAaAACTAA^ 

AAGTAACCATGACATCTCTCTGTTCXnTrAGCG^ 

CfTATGTICTAATlTCICTAATATItj^^ 

TGGAGTTAAGGTCXXATAGGATTAATCrC^CATAmCTATAm 



TTGTGAAAGTTCCTTTGAGAAAAC^AATATGCCCATA^ 
AtflATAACAAAGTCCAAATAATrcAGAA 
TTGCACClXKJATCA^C^Cm^^^ 
WXXXTNOTACATCTACMCCAAACC^^ 
{^GCTATCTXX^GATATTTAAAACCTaT^ 
ACCTCXXTCCCAAAAGAAAT<H7rCT<XTCTAT(X^ 
ACCCATGGA(XT(?TOACTG<X^TTTAAAGTG<JTTGTCCQAQT<JGATG 



ACTATC nUlTlUmWn C i rm 
TATCCCAAAAGTCCCCCATAACCTCCCCC^ 
CCCTGGCATTCCCCTGTACTGGGGCATATA^ 
AGTGATGGCCAACTAGGCCATCTTTTGATACATATGCAGCT^ 
TACTGGTTAGTICATAAIX}riX3TTGCACCTACA0GOTTGAA 



E»n4?(NBcll286)t 



Nik 11680 




(SEQIDNO:3) 
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Figure 8 



Exon/Intron 


Exon 


Exon 


Exon 


Intron 


No. 


start 


end 


length 


length 


1 


1 


542 


542 


2356 


2 


2898 


3094 


197 


1334 


3 


4428 


4557 


130 


3068 


4 


7625 


8019 


394 





Figure 9 



1 3661 4203 6559 6755 8089 1218 11286 11680 

(-3661) (f) (542) (2898) (3094) (4428) (4557) (7625) (8019) 
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Figure 10 





g^nomicjiu 


1 
1 


ogagagacgacagaaggttacggctgcgagaagacgacagaagggtccag 




cDKX_rat 
g*nomio_Ha 


51 
1 


aaaaaggaaagtgctggaggggagtggggacaaaagoagcgaeeaagtga 






101 
1 


atgtcacttcagtgactgaggccaggcaaaaogogcgggaaggattttgt 




gsAOttic_htl 


151 
1 


gtagcttgggaocctttcatagacactgatgacacgtttacgoaaaatag 




eDNA_rat 
ganoaiojbu 


201 
1 


aaatttgaggagaaacgcctgggccttcggaaaggagtgattgattagta 


□ 
i 


eDNA_rat 
genon±c_Hu 


251 
1 


cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 


;1 


cDNA_rat 
g&aoinJ.e_hu 


301 
1 


gggaggaggaagagcattacagagtttccagcagcagcaggaaagctttg 


? ^ 


geno£j.c_hu 


351 
1 


gttaatttggaaatggatgatageattaaaataacagaagegectccagg 


%Q 




401 
1 


tctotgaagottoagtcccooagotgaaagocagaaaagaotaagccoac 


; *«; 


gefioaxc hu 


451 
1 


taagacttttgatecctttggaagcaaagaactttcettecetggggtgii 




ganoaicjhu 


501 
1 


agactctcctcagaagatttcctgtctctgcctatgttacaagaggaatt; 


id; 


genomlc_hu 


551 
1 


aaaaccaagacagaagagctcaggatgcaggtgagaggcagggaagtcag 




genomic_hu 


601 
1 


oggottgttgatotoootcaotgotgtctgootggtggtoaooootgggn 




genomic_hu 


651 

1 


gcagggcctgtcctcgccgctgtgcctgctatgtgcccacagaggtgca<; 




cDMA_jeat 
g@nomia_hu 


701 
1 


tgtacatttcggtacctgacctccatcccagatggcatcccggccaatgt: 




ganoa±c_hu 


751 
1 


ggaacgaataaatttaggatataacagccttactagattgacagaaaacg 




genosii. o hu 


601 
1 


actttgatggcctgagcaaactggagttactcatgctgcaoagtaatggc: 




cDIW^rat 
g<anoaio_hu 


851 
1 


attcacagagteagtgacaagaecttetcgggettgeagtcettgcaggt: 






901 
1 


ottaaaaatgagotataacaaagtccaaatoattoggaaggatactttct: 
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genovl c_bu 

genoniojtra 
cDN\_rat 

dDM&_sa<b 
gano&i.c_Jhii 

genotiiojni 

cDNA_rat 
gases! c_Hu 

cDH*_rat 

CDNAjpet 
g©aoiic_tm 

dDM&jeat 
genottiojfcm 

cDKA_rat 
gencmic^Jhu 

dDNAjtat 
bu 

genoaiq_lra 
g&noa±o_hu 

genomlo_hu 

cDUArat 
gono8tio_hu 

cDK\_rat 
geno&±e_foi 

eSKAjrat 

<IDHA — rat 
ganomiejm 

<*BMA_rat 
genoaicjiu 

cIBtfcrat 
geaoaic_hu 

cDNft^rat 
ge&oale_hu 



95 
100 
105 
110 
115 
120 
125 
130 
135 
140 
145 
150 
155 
160 
165 
170 
175 
180 
195 
190 
195: 



acggactegggagcttggtceggttgcacctggatcacaacaacattgaa 

ttcatcaaccctgaggccttttAtggacttacctcgctccgcttggtaca 

tttagaaggaaaccggctcawaagctccatccagacacatttgtctgat 

taagctatctccagatatttaaaacctctttcattaagtacctgttcttg< 

tctgataacttcctgacctccctccoaaaagaaatggtctcotaoatgccJ 

aaacotagaaagcctgtatttgcatggaaacccatggacctgtgactgco 

atttaaagtggttgtotgagtggatgcagggaaacooagatataataaaei 

tgcaagaaagacagaagotcttooagtootcagoaatgtcocctttgcal: . 

gaaccccaggatctctaaaggcagaccctttgctatggtaooatotggag 

ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagago 

ctggttactcaggaggacaatggatctgceteeaeeteaeetcaagattt 

catagaaoootttggotocttgtctttgaacatgacanauntntctgga«i 

ataaggccgacatggtctgtagtatocaaaagccatcaaggacatcacca 

ectgcattcactgaagaaaatgactacatcatgotaaatgcgtcattttc 

cacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtggc 

aaottctggctttatacagtgactctcctctgatactagaaaggaagee<: 

cagcttaccgagactccttcactgtcttctagatataaaoaggtggotol: 

taggoctgaagacatttttaccagcatagaggctgatgtcagagcagacc 

ctttttggttccaacaagaaaaaattgtcttgcagotgaacagaactgcc 

accacacttagoacattaoagatooagttttooactgatgctcaaatcgc 

tttaccaagggcggagatgagagcggagagactcaaatggaooatgatoc 



* 
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in- 

y 



<=DNA_rat 2001 tgatgatgaacaatccoaaaotggaacgoaotgtoctggttggcggcact 

{jenosia mi 1 —————— _ 

ganoMic'hu att9Ccat5a ' ct ' toca 39 ciaaa Wo»aoccttcacctcacttggaatg 

g^^^hu 21 °1 ?f tt ^ ta9 ° t9atM * a ' taaa « t Sf a 8 a « ce ccttacgttagoga«jg«tg 

JjSEikfjm 215 J Wcgaateetaatagacaaaaatgggaagttggaactgcagatggctgac 

gwmdTta 22 °1 !?!^^!^ oa?9totttaccact sr cataa «« a ecaatgatgcagatgc- 

SUlS**! 225 l ??!!^!°*f a f a ^ a ff« a 2 aact ff t ^ a 9 a 9 ooota -tflff a S a a«!Jca 

g«!oaie k l»i ! a f"^ a ^9 <, ' tcca 9 <J * c * oa 9t9gttacgggtgagacgct:cgao 

235 i f^^*f^ ttt ^ < ««0^ t *cc.g«tgottot«tt^otgg*ttct 

eDHA_r&t 2401 tccagggaacactgtgttctctcagooatcaagagaeaggcaaattctta 

flwotie^au l 

24S J f^^^^^^^^acaggttacgceaaaagatoaagsrtcat 

S££c\u 2S °i ^ff^^^f^^f^ff^toaggggccgacttttocagttttaa 

S££\u 2M i !f!!!!!f!!! aaaa f aaa ?^« a fWtggttgagoatgacagggagg 

2? " 

£S£fhu !!!!!!??!?^ a «ccacgctc^gcgattcaggg i gcataggagg 

SSaSjM 2M i !!!!!!!!!!!!!!!!!!! a !^!^sr^««*agcaagaaaatacca 

2901 «ftaaagccagtgccactggotgttoccctcg^ 

S^^Th* 295 i ?! a !!??!!?!!^*W^ t 9 a t t "t«^atgaagaattcatggttct 
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genonic_hu 

ganoaio^ln* 

d>H\jrat 
geju?mia_h\i 

genoEioJbu 

cOMA_rat 
genomlojtra 

dDKA_rat 
geaomio_hu 

CDKfc_rat 
genoBic_hu 

* cDNA_rat 
genomic Jhv 

dDHArat 
g@naifli.e_hu 

oDN\_rat 
gmaalo_ha 

c*DNA_rat 
g*nofttc_hu 

gencai.c_hu 

oDHA_rat 
ganomic_hu 

geaoa±c_hu 

CDHA_r*t 
ganomia_hu 

eDNfcjcat 
genomic_hu 

dDNKjrat 
genomicjbu 

cDNA_rat 
ganoaic_hu 

eDNft_rat 
gano5ie_hu 

geno«ic_hu 
bDKfcjcat 

dDNA_rat 
gettOttie_hu 

ganoaio_hu 



3001 gaaaactaaggcttctggtgtcooaggaaggtcaccaactgctgactotg 

1 

3051 gaceagtaaatcatggttttatgaogagtatagettctggcacagaag1;c 

i . 

3101 tcaactgtgaatooacaaacactaoaatctgagcaecttootgatttcaa 

1 " 

3151 attatttagtgtaacaaacggtacagctgtgacaaagagtatgaacccat 

1 — 

3201 ooatagcaagcaaaatagaagatacaaooaaooaaaaeccaatoattatc 

1 — ~ ~ " 

3251 tttccatgagtagctgaaattogagattotgotoaggcaggaagagoato 

1 " ~~ 

3301 ttcccaaagtgcaoaooctgtaacagggggaaacatggctacctatggco 

1 " 

3351 atacoaacacatatagtagctttaccagcaaagccagtacagtcttgcag 

1 " *"* " ~ 

3401 ccaataaatccaacagaaagttatggacctcagatacctattacaggagt 

1 * 

3451 cagcagacotagcagtagtgacatctcttctcacactactgoagacceta 

1 - " 

3501 gcttctooagtoacccttcaggttcaoacaccaetgcotogtotttattt 

1 ' 

3551 oaoattcctagaaaoaacaataoaggtaacttoooottgtccaggcartt 

1 " — 

3601 gggaagagagaggacaatttggagcagagggagagttaaaaacccacata 

1 

3651 gaaccccagttotccgacggcatagaoacaggactgtgaggccagcaatc 

1 7 

3701 aagggacotgctaacaaaaatgtgagccaagttccagccacagagtaoco 

1 

3751 tgggatgtgccacacatgtccttccgcagaggggctcaoagtggotactg 

x 

3801 cagcactgtcagttccaagttcatcccaaagtgQoctccccaaaactiiat 

1 " 

3851 aatgttggggtoatagcagaagagtctaccactgtggtcaagaaacoaot 

1 ~ 

3901 gttactatttaaggaoaaaoaaaatgtagatattgagataataacaaeca 

1 

3951 ctacaaaatattocggaggggaaagtaaccaogtgattoctacggaagea 

1 

4001 agcatgacttctgctccaaoatctgtatccctggggaaatctcctgtaga 

1 

4051 caatagtggtcacctgagcatgcctgggaeeatceaaaetgggaaagatt 

1 

4101 cagtggaaacaacaccacttcccagccccctcagcacaccctcaataooa 

1 
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genoul> c__Htj 

cDHA_rat 
ga&omicjra 

cDN&^rat 

g&nOU3.0_bU 

cDHA_rat 

g&not&^Lc 

cDNA_rat 
yffionil Ira 

cDNA_rat 
gcnomLc 

geno&lo^fau 

ganomic_hu 

cDMArat 

genosilc_hu 

cDNA_rat 
geoo&£c_ha 

cOMArat 
genonie_hu 

<=DHA_rat 
geno»ic_hu 

geno*ie_hu 

csDKAjrat 
genoiic^hu 

cDKX_xat 
gencsxc tai 

cDNA_rat 
genomlcjhu 

genonicjhu 

g^nnini cjiu 

cDKArat 
ge&os&c Ira 

eDH*_rat 
gfttxo&lo fau 

gsaoaic 2ra 



415 

420 

425 

430 

435: 

440 

445 

450 

455 

460 

465 

470 

475 

400 

485; 

490 

495 

500 

505 

510 

615 

520 



acaagcacaaaattctcaaagaggaaaactccottgcaooagatctttgt 
aaataaccagaagaaggaggggatgttaaagaatecatatcaattcggtt 
tacaaaagaacecagccgcaaagcttcccaaaatagctcctottttaccc 
acaggtcagegttcceccteagattctacaactctcttgacaagtccgoc 
accagctctgtctaoaaoaatggotgccactcagaacaagggcactgaag 
tagtatcaggtgccagaagtctctcagcagggaagaagcagcccttcaoo 
aactootQtccagtgcttcctagcaccataagcaagagatctaatacatt 
aaacttcttgtcaacggaaacooooaoagtgacaagtcctactgctactg 
eatctgteattatgtctgaaacccaacgaacaagatccaaagaagoaaaa 
gaccaaataaaggggcctcggaagaacagaaacaacgcaaacaccacooc 

caggoaggtttctggctatagtgoatactcagctctaacaacagotgata 

cccccttggctttcagtcattccccacgacaagatgatggtggaaatgta 

agtgoagttgcttatcactcaacaacotctcttctggccataactgaact 

gtttgagaagtacacccagactttgggaaatacaacagctttggaaaeaa 

cgttgttgagcaaatcacaggagagtaocacagtgaaaagagcctcagao 

acaccaccaccactcctcagcagtggggcgcccccagtgcooactccttc 

cccacctccttttactaagggtgtggttacagacagcaaagtcacat<^g 

ctttccagatgacgtcaaatagagtggtcaccatatatgaatcttcangg 

cacaatacagatctgcagcaaccctcagcagaggctageceeaatoctga 

gatcataactggaaccactgactotooctctaatctgtttccatccactt 

ctgtgccagcactaagggtagataaaccacagaattctaaatggaagcco 

totccctggccagaacacaaatatcagctoaagtcatactccgaaaccat 
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g^aosii. o_hxi 

gonial e_hu 

-g©nomic_hu 
cDHA_rat 

geno*ic_hn 
<Stttt&_xat 

cDNA^rat 

dOHKjc*t 
genomie_hu 

aDNA_rat 
ge&oai.c_Jro 

oDMA_rat 
geno£ie_hu 

cOK\jrat 
genomlc^fau 

d>NA_rat 
gsnomio__hu 

gonoaic_hu 
oDKA_rat 

eDIUjrat 
genoaiqjhu 

gdtumic^hv 
^DHA^rat 

gon09L&0_llU 

cDHArat 
genomic 1#V 



525 

530 

535 

540 

545 

550 

555 

560 

565 

570 

57D 

580 

585 

590 

595 

600 

605 

610 

615 

620; 

625 



tgagaagggcaaaaggccagcagtaagcatgtecccccacctcagccttc 
cagaggccagcactcatgcctcacactggaataeacagaagcatgcagaa 
a&gagtgtttttgataagaaacctggtcaaaaoocaacttccaaacatct 
gccttacgtctctctacctaagactotattgaaaauagccaagaataattg 
gaggaaaggctgcaagctttacagttccagctaattcagaogtttttctt 
ccttgtgaggotgttggagacceactgcccatcatccactggaooagagt 
ttcatcagganttgaaatatcccaagggacacagaaaagccggttecaeg 
tgcttccoaatggcaccttgtccatccagagggtcagtattcaggaccgt 
ggacagtaeetgtgctctgcatttaatceaotgggcgtagaccattttca 
tgtctctttgtctgtggttttttacccggcaaggattttggacagacatg 
ccaagga^attacagttcactttggaagtagtgtggaactaaagtgcaga 
gtggagggtatgccgaggcctacggtttcctggatacttgcaaaccaaac 

ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaacacctg 

r 

atggaacattgatcatctataatctgagtctttatgatcgtggtttttac 
aagtgtgtggccagcaacccatctggccaggatteactgttggttaagat 
acaagtcatcacagctccccctgtcattatagageaaaagaggeaageea 
tcgttggggttttaggtggaagtttgaaaotgccctgcactgcaaaagga 
actccccagcctagtgttcactgggtoctttatgatgggactgaactaaa 
accattgcagttgactcattccagatttttcttgtatccaaatggaactc 
tgtatataagaagcatcgctccttcagtgaggggcacttatgagtgcatt 
gccaccagctcctcaggctcagagagaagggtagtgattcttactgtgga 



22/85 



etg 
gat 
oca 



ret 

3CA 
SCA 

ITT 
kCI 

:cc 

.'CC 

CT 
CA 

CA 
CA 

TG 

IG 

rc 
5t 

AC 
AC 

GA 
SA 

aa 
it 

?r 

T 

'A 
A 

T 

C 



*0 

iu 



rat 


7351 


LO_hu 


890 


rat 


7401 






rat 


7451 


.ejhu 


990 


rat 


7501 


.e__hu 


1040 
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CttTGGAAGCAGAGTGACGGTCCATCCAAATGGAACCTTGGAgatgagga 
Ct aTGGAAGCAGAATCACAGTCCATAAAAATGGAACCTTGGAaat tagg a 

acatccGGCTTTCTGACTCTGCGGACTTCACCTGTGTGGttcggagcgag 
atgtgaGGCTTTCAGATTCAGCCGACTTTATCTGTGTGGcccgaaatgaa 

ggaGGAGAGAGTGTGTTGGTAGTGCAGTTAGAAGTCCTAGAAATGCTGAG 
ggtGGAGAGAGCGT GTTGGTAGTACAGTTAGAAGTACTGGAAATGCTG AG 

AAGACCAACATTCAGAAACCCATTCAACGAAAAagtcatcgcccaagctg 
AAGACCGACATTTAGAAATCCATTTAATGAAAAaatagttgcccagctgg 

gcaagcccgtAGCAGTGAACTGCTCTGTGGATGGGAACCCCCCACCTGAA 
gaaagtccacAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAA 

a 1 1 a cCT GGATCTTACCTGACGGCACACAGT TTGCTAACAGACCAca ca a 
at aa t CT GGATTTTACCAAATGGCACACGatt TTCCAATGGACGAcaaag 

ttccccGTATCTTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAAgcaa 
1 1 a t caGTATCTGATAGCAAGCAATGGTTCTTTTATGATTTCTAAAaoaa 

ot cggaaca agt CAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGC 
ctcgggaggatgCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGC 

TAeatcgagaaactcatcctgttagagATTGGGCAGAAGCCAGTCATTCT 
TAtattgagaaattagtoatattagaaATTGGCCAGAAGCCAGTTATTCT 

GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGGAACCGTTATCAC 
TACCTAT GCACGAGGGAoagt aaaaggoaT CAGTGGAQAATCTCTATCAC 

TGCATTGTGTGTCTGATGGgATCCCCAAGCCAAATGTGAAGTGGACTACA 
TGCATTGTGTGTCTGATGGaATCCCTAAGCCAAATATCAAATGGACTATG 

CCGGGTGGccATGTAATCGAC&GGCCTCAAGTGGATGGAAAATACATAcT 
CCAAGTGG t tATGT AGTAGACAGGCCTCAAATTAAT GGGAAATACAT A t T 

GCATGAAAATGGCACGCTGGTCATCAAAGCAACMCAGCtcacgaC(^iAG 
GCATGACAATGGCACCTTAGTGATTAAAGAAGCAACAGCttatgacagAG 

GAAATTATATCTGTAGGGCTCAAAACAGTGTTGGCCAggcagttattagc 
GAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCAtacaotgattact 

gt gtCAGTGATGGTTGTGGCCTACCCTCCCCGAAT CATAAAC t ac CtACC 
gttcCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAAAtogtccifi.CC 

CAGGAACATGCTCAGGAGGACAGGGGAAGCCatgCAGCTCCACTGTG^'GG 
CAGGAGTATTGTCACCAGGACAGGGGCAGCCtttCAGCTCCACTGTGTGG 

CCTTGGGAATCCCCAAGCCAaAAGTCACCTGGGAGACGCCAAGAGACl'CC 
CCTTGGGAGTTCCCAAGCCAgAAATCACATGGGAGATGCCTGACCACl'CC 

CTGCTCTCAAaagcaacagcaagaaaacccCATAGAAGTGAGATGCTTCA 
CTTCTCTCAAcggoaagtaaagagaggaoaCATGGAAGTGAGCAGCTlCA 

CCCACAAGGTACGCT gGTGATTCAGAATCTCCAAACCTCGGATTCCGGag 
COTACAAGGTACCCTaGTCATTCAGAATCCCCAAACCTCCGATTCTGCiga 

tcTATAAGTGCAGAGCTCAGAACCTACTTGGgacTGATTACGCAACM^CT 
taTACAAATGCACAGCAAAGAACX:CACTTGGtagTGATTATGCAGCA«,CG 

TACATCCAGGTACTCTGAGAGGAAgggggagactaaaattoaacagaagt 
TATATTCAAGTAATCTGACATGAAataataaagtcaacaacatetgggca 
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cONX_rat 8401 ccacatccacagggTTTATTTTTTGGAAGAAGTTTAATGAAAGGCAGCCA 

0«noaio_hu 1940 gaa ■ — TTTATTTTTTGGAAGAAGTTTAATCAAAGGCAGCC& 

cD»rat 8451 TAGGCATGTAAAXGAgtcTG^TACATTTAGAGTATTMATTTACAATGS 

goxx<MLlo_hu 1979 TAGGCATGTAAATQAattTGAATACATTTACAGTATTAAATTTACAATCA 

CDNA_rat 8501 ACATGCgatga GACTTGTAAATGAAAGCATTGTGAACTGAaaccg 

genomic_hu 2029 ACATGCaaaataaaagGACTTGTAAATAAATGGATTATGAACTGAtgatii 

<sDMA_rat 6546 agtctctg TGGATCTCAAAGCAAACTCTTAACTTAAGGCACTTT»T 

geaottxo_hu 2079 otgatttatttaaTGGATCTCAAAACAAACTTTTAACTTAAGGCACTTTI; 

^-f a \ ™ MTTT ^^^TAATAACAAACAttaagagaaaaaaatgatcGACTA 

genciiejm 2129 ATTTTGCCAACAAATAACAATAAACAaaca 1 1 gaaa cggt t CACTA 

o£SHA_rat 8641 CGAAATAACAAACGGCTAATGGACCTGAATTctcagtaaaaagacctttc: 

genoaiojni 2175 TAAAATAACAAATGGCTAATGTACCT(SAATTtttcagtaaaaaaatgaac; 

COHA rat 8691 tctcgctaaoagttgCCAGCTGCCTCGTGTCTGTTTCCTACCAATGTCAC! 

ganomlcjra 2225 ttctaata CCAGTTGCCTAGTGTCCACCTCCTATCAATGTTAC: 

cOKA^rat 8741 AAACAt eg caca cagggtGAATGGAGTCAACGGGAAAGATTAAGTTTGCCf 

genomicjiu 2268 AAGCAtggcactca GAACAGAGACAATGGAAAATATTAAATCTGCfl. 

cDNAjrat 8791 GTCtgtgtaaatctca ATGTACAAATATTCTGtcncTG 

genoalcjra 2314 ATCtttatgatgtaaatttaccatcctgATGTATAAATATTTTG TO 

exonl (2342-2397) 

oOHArat 8791 GTCtgtgtaaatctca ATGTACAAATArTCTCtcncTG 

genoai<;_hu 23X4 ATCtttatgat^aaatttaccatoctgATGTATAAATATTTTG- TG 

eDNA_rat 8020 GTTTATAAACATTTTGATAAAACCG&AAAAAAA&AAAAaaaaaaaaaaaa 

B*no»ioJta 2360 GTTTATAAATTTTTTTGCTAAAACCIACAGAAAATAAQcactgaactgtc 

(Genomic human OCP: SEQ ID NO:4) 
(cDNA rat: SEQ ID NO:5) 
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Figure 1 1 
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608 Human translated nucleotide sequence (ORF) 

ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGT 

GATCTGCCTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTG 

CCTGTTATATGCCTACGGAGGTACACTGCACATTTCGGTACCTGACTTCCA 

TCCCAGACAGCATCCCGCCCAATGTGGAACGCATCAATTTAGGATACAAC 

AGCTTGGTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGGA 

GTTACTCATGCTTCACAGCAATGGCATTCACACAATCCCTGACAAGACCTT 

CTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAATAAAGTCC 

GAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGATTG 

CACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGG 

GCTCAACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCT 

CCACCCAGATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTC 

TTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCl'CCCTCCCTCAA 

GAGATGGTCTCCTATATGCCTGACCTAGACAGCCTTTACCTGCATGGAAA 

CCCATGGACCTGTGATTGCCATTTAAAGTGGTTGTCTGACTGGATACAGCC 

AGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAGCAGT 

GTCCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATG 

GTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGACTCATCCCTG 

AAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATCTCT 

CCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGA^TATGACAGAT 

CAGTCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAG 

GACATCACCCATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATA 

CTTCATTTTCAACATTTTTGGTGTGCAACATAGATTACGGTCACATTCAGC 

CAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAA 

GGAGCCACTTGCTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAG 

GTGGCTCCTAAGCCTGAAGACATTTTTACCAACATAGAGGCAGATCTCAG 

AGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAGCTGAACA 

GAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

CAAATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGA 

CTATGATTTCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTA 

GGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACA 

CGTGGATTGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCA 

GTGAGGATGGACGGATCCTAATAGACAAAAGTGGAAAA1TGGAACTCCA 

GATGGCTGATAGTTTTGACACAGGCGTATATCACTGTATAAGCAGCAATT 

ATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAACCTTTGG 

TCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTrrCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGC 

TGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAA 

AGTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACC 

AAGGTTATTATCGCTGTGTGGCAGCCAACCCATCAGGGG'rTGATTTTTTGA 

TTTTCCAAGTTTCAGTCAAGATGAAAGGACAAAGGCCCTrGGAGCATGAT 

GGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCT 

TAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTCjATGGAGGCTG 

AGG1TGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGGGA 

ATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGrTTTAGGGAGA 

ATAGGAGGCATTrCCCl'CCCTCTGCTAGGAGAATTGACCCACAACATTGG 

GCGGCACTGTTGGAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAG 

AAAATACCACAGTGAGCCCACCCCCAGTGGTCACCCAACTCCCAAACATA 

CCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAATT 




TATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTG 

CTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATAAATTATGGC 

ACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAACCC 

ACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAA 

TATAAACCCAACCATGTCAAGCCAAATACAAGGCACAACICAATCAACATT 

CATCCACTGTCTTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTG 

ACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATAAC 

AGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCA 

CCAACAAACTATIATTAGAGTCAGTAAATACCACAAATAGTCATCAGACA 

TCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTCTATTCTCACAC 

TACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACAGCTGC 

TCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTT 

CAGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATC 

AGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAG 

GTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCA 

CAGTGCTCAATGTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCA 

CTGCCACAGCAGCATTGTCTTTTCCAAGTGCTGCTCCCATCACCTTCCCCA 

AAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACTCTAGTCCAG 

AATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACC 

CACAATAAAATATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGTG 

CAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAA 

GTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGA 

TTCAGTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGAC 

TATTATAGCCATTACAAGGTTTTCAAGAAGGAAAATTCCCTGGCAACAGA 

ACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAATCAACATAAA 

GTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCT 

TTACCACAGAGACAAAGTTCCCCT1TCCATTTCACCACACTTTCAACAAGT 

GTGATGCAAATrCCATCTAATACCTTGACTACCGCTCACCACACTACGACC 

AAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCC 

ACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAA 

AAAGCATCATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTC 

CCTGCATCTGTCATCACTTATGAAACCCAAACAGAGAGATCTAGAGCACA 

AACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCA 

AACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACA 

CCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAAAACACAACTGGGATT 

TCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACAGATGTGATT 

G AAGA ACTAGCC C AAG C AA GTACTCA G ACTTTG AAG AGC AC AATTGCTTC 

TGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAG 

CATCATTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAA 

TGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACAC 

GTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCAC 

GAATCCTCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACT 

CACTTCATCTACCTCTAATCTGTTACATTCTACTCCCATGCCAGCACTAAC 

AACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAGAAT 

ACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAG 

CCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCT 

TGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATA 

AGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTT 

TGTCTAGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCA 

AGTTTTACTATTCCAGCTAACTCAGATGCCTTTCTTCCCTGTGAAGCTGTT 



GGAAATCCCCTGCCCACCATTCATTGGACCAGAGTTTCAGGACTTGATTTA 

TCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCT 

GTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCG 

CATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTGGTTT 

CCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCAT 

TCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCC 

CTACAGTTACCTGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCC 

AGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGAACATTGGTCCTCCAC 

AATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCA 

GGTGGCCAGGATrCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACC 

TGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAA 

GTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGCGTTTACT 

GGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCA 

AGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTT 

CAGACAGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAG 

CGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGA 

TAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTA 

CTACTGAACTGCTCAGCCACTGGGGAGCCCAAACCCCAAATAATGTGGAG 

GTTACCATCCAAGGCTGTGGTCGACCAGTGGAGCTGGATCCACGTCTACC 

CTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGTGTC 

TACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCA 

TGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTA 

GAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCC 

GGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGAT 

CAACAATGCAATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATAT 

ACCCTTTTCAACAATGGAACTTTATACTTCAACAAAGTTGGGGTAGCGGA 

GGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 

ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAG 

TAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACT 

GTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCC 

AATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 

TCTTTGACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATG 

TGTAGCCCGAAATCCCAGTGGGGATGACACCAAAATGTACAAACTGGATG 

TGGTCTCTAA ACCTCC ATT A ATC AATG GTCTGTATAC AA AC AG AACTGTTA 

TrAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCT 

GAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTT 

CCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAATGGAA 

CCTTGGAAATTACiGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTG 

TGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTA 

CTGGAAATGCTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAAT 

AGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGCTCTGTTGATGGTA 

ACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCA 

ATGGACCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCTTTTATCA 

TTTCTAAAACAACTCGGGAGGATGCAGGAAAATATCGCTGTGCAGCTAGG 

AATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATTGGCCAGAA 

GCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAG 

AATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCA 

AATGGACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGG 

AAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGC 

TTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATA 



CACTGATTACTGTTCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAA 

ATCGTCCACCCAGGAGTATTGTCACCAGGACAGGGGCAGCCTTTCAGCTC 

CACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCC 

TGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTG 

AGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCC 

GATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTA 

TGCAGCAACGTATATTCAAGTAATCTGA 
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Figure 14 



rat_oDNA 

human_5+3_corrected 
Was cDNA 5 



C^G&£ACGA£AGAAGGTTA0G5C^ 



-CAGAAGGGTCCAGGAAA-GGAAA 



rat_cDHA 

huraan_5+3_corrected 
mus cDNA 5 



rat_cDNA 

human_5+3_correcteci 
mus CDNA T 



rat_cDNA 

huaan_5+3_oorreoted 
mis cDNA 5 



ratjsDHA 

hm^_5+3_corrected 
mus cDNA 5 



**t_cDNA 

human_S +■ 3_cor rec ted 
mus CDNA 3 



rat_CDNA 

human_5+3_corrected 
taus CDMA 5 



rat_oDNA 

h.uman_5+3 corrected 
mis cWk 5 



rat^cDNA 

haman^5i-3_corrected 
BUS cDNA 5 



rat_cDNA 

huraan_5+3_correeted 
was CDNA 5 



rat_cDNA 

humak^5+3__corrected 
mus cDNA 5 



rat_cDNA 

lxuman_5^3_corrftctcd 
mua_cDNA_5 



rat_cDNA 

human_5+3_corrected 
mus cDKA 5 



QTGCTGGAGGGGftOTC^GSAC3^AAAGCAGCGftCCA^^ 



GTACTGGAGGG<3A<?ITG^ 

GCCAGGCAAWUXCGQSGGAAaM 
GCCAGGCAAAAGGAGCGGGAAG<^ 

GACACGTTtACGCAAAATA-<3^ ga 

GACOTGTTTGTGCAAAQCAAGauaOT 

GttMTGATEACTJWTTTtX^ 

<H?GATCGiU , TAGTACTTGTAAGTTTAGGTGA<^TT GAGAACTAACTAACX^ATACTA 

TTGAGQGACGA^AQAQCATTAO^GTTT^ 

TTGAGC^AGAAGGAAGAGCATT ' — CCAGGAGCAGCAQGAAAGCrTTGGTTAGT 

TTGGAAATGGA3^TACCATTAAAATAAC^^ 



TWKSAAMGTATGMACCAT^^ 
CCCOCAGCTGAAAOCCAGAAAAjQ^^^ 

CCCCCAGCT A^TGTMQCCTACTAA^TTTTGC^^^ 

AAGAACTrrCCTTCCCT<^GOT^ 



AAfiAACGTTCCTTCAATCA£3GTGAAfiGCTCTCCXCAQAAQAOT 

^CRACA<*3AATCAftAACCA^ 

ATGAAGGTAAAAiGGCAGAGGAA 

TTACAAGA^rra&AAGC^^ 

*** ** * ****** * * 

TC&K^XOTGTTGATCTrc 

TCAOCTGCTTGCTGGTCXCCTTTiKrTQTGAXCTGCCTGG 
TCAGCTGCTTGCTGATCTCCC^ 

*** * ***** ** ***** * *** ********** * ********** *«, ** 



CCT<5TCCTC(^CGCTGT^^ 
CCTGTCCT(»(X^GTGC(^mSA^^ 
TCTGTCCTCGCCX^lCTQCCrG^ 
************ ******** *** **** ** ***** ***** ********* *** 

TGACTTCCATCCCAGAOIG^^ 
T(5ACCTCmTCCX3U3ACGGGCM 

**** *********** * ****** *********«*,* * ******** ** *** 
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huioSn_5 +3_co r rect ed 
BUS CENA 5 



AGCCTTACT*V5ATTG2VCAlSAAAAOG^ 

AGCTTGGTTAGAT TGAlOGftAACAGATTTTTCTGGOCTGft)^^ 

AG<XTCACTAl(yUtf<^^ 

* ***+ ** *** ******** ** **************** 



rat_cDNA 

huaan_5+3_correcte<J 
ocas cDNA 5 



CTGGAQUMWVTGGCOTTCACAGA^ 
CTTCACAGCAATGGCMTCAC^^ 

** ***** ************* * ** *************** * ****** ***** 



ratjCDNA 

hia»^_5+3_corrected 
mus cUNA 5 



C&GCTCW!AAAAATG*^^ 

*********************** ******* #* ** *** ******** ** ** 



ratj6t*3EA 

hvaaan_5+3_corrected 
mus cDNA 5 



CTCGGGAGCTTGGTCC(^FGCACCTGGA3CCftC^ 

CTGftGGAGCTTGACROC&TTQCA^ 

CTCAGGAGCTTG&££CGGTTGCACCT 

»** ******** ** ****** **** ******** ***** ** ** ***** *** 



human_5+ 3_CO rrec ted 
mus cD&A 5 



GCX^PmAIQGACTTACCTCGCri^^ 

GTTTTTTATGGa^CAACTTTCTC^ 

GCGTTTTACGGftCTtf^ 

* ***** ** ** * ** ****** **** ** * ******** * *** ** *** 



rat_cBHA 

humaii_5+3_correoted 
mus cDNA 5 



CTCCATCmSUjACACATTTGT 

CTCGACCOlGAIPACATTTGTC^CTTTGAGCT 

CTCCATCX^GftCAC&TTTGTCTCTTTG 

***** ***** *********** ** ***** **************** *** ****** 



ratjafclA 

htafiaa_5+3_correoted 
mus cDNA 5 



AAGTACCTCTTCTTGT^^ 

AAGTTCCTAIftCtfTGTCTGATAACTTCCTHjACCT 

AJVSNACCTGTACTTGTAl^T^^ 

*** *** * ***** ********** * ************ **** ********** 



rat_cENA 
j ^ human^S+S^corrected 
SOUfl cDNA 5 



CMGCCMACCTAGAAWSCCT^ 

TATGCCAAACClttGAAAG^^ 
***** ******* ***** ** ************************* ******** 



rat^cDNA 

huaan_5+3_corr€*oted 
mud cDfclA 5 



AAAGTGGTTGTCT<^GTGGATGCW3GGAAACCCAQAI 
JUUUSTGGTCGTCTC^CTGGA^^ 

AAAiCT<3GTTGTCCGAGT<^TGC^^ TXGTTTG 

************ ** ***** **** **** ****** * * 



rat_cDNA 

human_5+3_CQ rreeted 
aus cDNA 5 



AMCTCTTCCAGTCX^rCAGC^ 
AAGT(XCTC^7^GCTCAG<^^ 
imCTTTTTTTATARKAOTTATTTTCCT^^ 

* * * * * *** * ** *** * * * **** * 



ratjoDNA 

hunSn_5+3_correated 
mod cDNA 5 



GACCCTTTGCTATGGTACX3ATCTGGAGC^ 
JUS^TTAGCTASGGTCI^ 

-CXXXMAACXrrCCCOCOCA CTTCCCTACCTACCCATTC — CCATTTTTT GGC 

** * * +* *** ** * * * * ***** 



rat_cDWA 

huaaa_5+3_corrected 
OUg CONA 5 



CCCTGAAATCAAAGAGCCTGACTATT^ 

CCTGGCATTCCCC . 

* * * ** 



rat_cDUA 

human_5+3_COrrft<rtftd 
mus cDNA 5 



AAGM17CATAGAAGCX^TGGCTCCn^5TOT 
JlAGGTTTCar(K5CACCCTTTG(3CTCOCTCACTTTGAATA 



r&t_cDNA 

huwaa_5+3 corrected 
1W aDHA 5 



AGGCCGACATGGTCTGTAGTiVrCX^^ 
AAGCTAACATGGTCTGC^U5TATTCARAAJSCCCTCAA^ 



• 
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rat^oDNA 

htuaan_5+3_corrected 
mus cDNA 5 



AAGAAAATGACTACAfCGT(Xm^ 



rat_cDNA 

bunan_5+3_co rrected 
fitUS oDNA 5 



ATTATAATCACATCCAGOCAJtn^G^ 
ATTACGGTCfcCAiraCA^ 



human_5+3_co rrected 



TACTAGAAAGGAAGCKCCAGCTT^^ 
TACTAQAAAOTVKX^CTTGCTTAS 



rat_cDKA 

human_5f3_corrected 
IQU6 cDNA 5 



TGGCTCTTAGG0CT(5AAQACftOTTTTACCftSC^ 
TGGCTCCTAAGCCTGAAGRCATTTO 



rat_cDNA 

human_5+ 3_co rrected 

JttUS CORA 5 



TTTGGTTOCAACAAGAAAAAATTGTCITG^^ 
CtlGOTTAATGOUVSACCAAATPlXXT^^ 



rat^cBNA 

human_5+3 corrected 
xaua cDNA 5 



CATTACAGATCCAGTTTTCCACTtaATG 
CAMACAGATCCACTACTCCA*^^ 



jC$t__cCNA 

himvan_5+3 — corrected 
mas cDKA 5 



OSGAGAGACtCAAATGGAGCATG^ 
CAGrGAAACA(»AATG<3^ 



rat^cCrtWl 

hunSn^5+3_correated 
sus cDKA 5 



TCXW^TGGCG(X3*CTATO 
TCTTGGTAGGXtS^CCCT!^^ 



rat_cDNA 

human_5 + 3^co r rec tod 
mis cDNA 5 



0K3<^TG<XZrTCXAaCTGM 
TO*AT?GGCTTCTAGCT^^ 



rat_cDHA 

huaatn_5+3_coxreote<i 
mas cDKA b 



(aATCCTAAEAiGAGAAAAAIGGG^^ 

g^tcctaaiagacaaaagtggaaaattggaa^ ttgacacag 



rat_cDNA 

bunaii^S+S^correctcd 
mus cDNA 5 



GTCTTTACCACTQGAIAW3CACCAATGATGCAGM 
C5<!X3TATAICACTGTATAAfiCAGCMTTAT<3AJGAT 



rat__cDNA 

hu»an_5+3_c©rreoted 
sous cDNA 5 



CHSTGCTA6AG0CX!TATGGAGawA^CACAr^^ 
CTOTGGTAGAACCTTTGGTCGAAGCCTA^ 



rat_cONA 

huaaaJS+S^co rrected 
sous cDNA 5 



COGOTGAGAOGCTCXjACCTTOCArOCCTTTC^^ 
TO3OTGAAACACXT<3AirCTTCC^^ 



rat_cDNA 

humon_5+3_corrected 



GGATTCTTCCAGC^AACACTGTtS^^ 
GGGTTAITCCAGGAAACAATGTGCrCTATC^^ 
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rat_oDNA 

hwan_5+3_correcte<a 

WHS CCWA_5 



ret cDNA 

human_5+3_correctea 
nrus_©XJNJV_5 



rat CDNR 

human_5+3_corrected 
mis cONA_5 



rat_cDttfc 

taumaa_5+3 corrected 



rat cDNA 

buaian_5+3_corr<»ot«»<l 



rat_cEWk 

huntan^S+3 corrected 
bus cDNA_? 



rat cDNA 

human_5+3_correo£ed 
mus cWAJ 



human_5+3_corrected 



rat ct>NA 

h\ro5n_5+3_correoted 



rat_cDNA 

human_5+3_oorrectea 
mus CDNAJ5 



rat^oDNA 

h\H&an_5+3__c©rreoted 
zaus_CDNAJ5 



rat_cDNA 

huma»_5+3_eorrectad 

XBU0 cENA_S 



rat_cwcv 

huran_5+3_c©rr«cted 



rat cDNA 

taaaA_5+3„corrected 
imisjcDMAjB 



CTATGCCAGAG&VGO 



ITGGCTGTTCCCX^CG 




34/85 



rat^cDNA 

humaii_5+3_corrected 
euls_CDN7\_5 



. CAGGCA GGAAiSAjGCMCTT^^ 



rat_ctNA 

huraan_54- 3_corrected 
BUS CDNA 5 



ACCTATGGCC^TACC^CAC^^ 

JVAAGMGOTC^GT0^3«ATGCT TO* 6 



rat_cDNA 

hua5n_5+3jcorrected 
mud cDNA 5 



rat_oDNA 

huaian — 5+3 — corrected 
WIS oDNA 5 



CTAQCAGTAGTtSACATCTCtTCT^ 
CCAGGCACftATCACTTCTATTCTCACaCTACT 



rat_oDNA 



CAGGTTCACMMCACTGCCTCGTCTTtA^ 



hus>an_5+3_e6rrected 

was CDMA. 5 



ACATCCCGCT6TTCRjGA0GCTTTGG<^^ 



rat_cDNA 

humaa — 5+3_corrected 
jous CDNA 5 



AAAACXX1AGATAGAACXXCAOT 

TCA)GC0CATATAGftACT0CAGTTCrGCQAOGGCA!IAGAIT 



rat_oPHA 

hOTan_5+3_corrected 
mus cDNA 5 



XCftAGGGACCTGCTAACAAAAA!^^ 
CXWaGGTTCTTCTGAAftAAWSCACT^ 



r«t_cDNA 

hiaaaD_5+3_corr*cted 

cDNA 5 



GCCAraC^GTCCTTCCGC^ 



rat^cBNA 

huroan_5+3_corrected 
mua aDNA 5 



GTTCATCCCACAGTGO^^ 
(3T<K?TGCrCCCATCACCTTC0^ 



rat_cDNA 

hravan_5 + 3_eo rrect ed 
mus cDHA 5 



CCACTOTGOTCAAGAAROCA^^ 
CAACTCTAlSTCX^Q^CCACTSOT 



rat_cDWA 

huroan_5+3_eorrectftd 
rms cDNA 5 



TMTAACAAOCRCTAfcAAAAT^^ 
jiNACAACACCCHVCAAIAAJ^^ 



rat_cDNA 

h.uman_5+ 3_co rrected 
was cDNA 5 



(^AGCAT(^CTTWCTCCAACATCTG^ 
CAGTCMGACATATQCrrCCAACATCaU^ 



rat_CDNA 

human_5+3_co2rrected 

mus cDNA 5 



OXCACCTGRGGAIGCCTGGGA^^ 
GTXACraCGTCTOTC^ 
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rat_ctft*A 

toa»ai*_5+3_oorrected 
mis cDN&_5 



rat^cDNA 

htfflian_5+3_corrected 



rat cDNA 

Iunnan_5 + 3_co r rect ed 
nnis_cPNft_5 



rat cDNA 

human_5+3_corrected 
Xjcus cDUA__5 



rat cDNA 

h>anian_5+3_correctftd 
mus cDNA_5 



rat_cDNA 

hvaoan_5+3 corrected 



rat cDNA 

huitiaa_5+3_COrreatad 



rat cWh 

human_5+3_correoted 
nms c5na_5 



rat_cBNA 

human_5+3_correated 



rat_cDUA 

kuaanJ5+3_correctad 



rat cDNA 

human_5+3_correcte<l 



rat_cEWA 

buman_5 + 3_crf> r rect £d 
mas cDU&_5 



ret cDNA 

huiaan__5+ 3_porrected 



rat cDtfA 

hun3n_5+3 corrected 
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ratjcDNft 

human_5 4- 3_co rr ect ed 
tons cDNft_5 



rat oCWA 

kunw_5+3_oorr«ctAd 

SLUJ9 CfNA 5 



rat_oot9A 

human_5+3_oo rrected 
mus cDNA_5 



rat cDNA 

hunSn_5+3_corr&oted 
mus cDNA^ 



rat cDNA 

buflSa_5 + 3 - ccrreoted 



human_5+3_correoted 
mus cDNA_5 



rat cDNA 

h«t5a_5+3_correcte<i 
mus cENRjS 



rat__pDNft 

humam_5+3_oorrected 
sms oDNA 5 



rat_cDNA 

human_5+3^oorrected 



rat cDNA 

huiaan__5+3_eorrected 



fcuman_5 4-3 — corrected 
mis cDNA 5 



rat oDNA 

human_5+3„corrected 



rat_cDNA 

human_5+3 — corrected 



rat cPNft 

human_5+3_corrected 



CTGTGGT 

CTGTGGTTTCCTATCCX^ 
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rat cDNA. 

human_5 + 3_C0 rrected 
naifi CdSk 5 



rat cDNfc 

hutt£n_5+ 3_corrected 



xat_oDNA 

hu»an_5+3_corrected 

jttUS ODNA 5 



rat_cPMA 

hiaioan_5+3_Gorrected 
mus cENA 5 



ret cDNA 

h\2flSn_5+ 3_corrected 
mus cDNA 5. 



rat oDHA 

human_5+ 3_corr ect ed 
mus cDNA_5 



rat — cDNR 

human 5+3 corrected 



fflus_cDKA_ 5 



rat_cDNA 

human_5+3 — corrected 
mus cDNA_5 



rat_cDNA 

human_5+3 corrected 



rat^cDNA 

htanan^S+S^correoted 
mus cDNA 5 



GTTTGaA^GCCCTGOKWKW^^ 



rat_CDHA 

kumanJS -»-3 — corrected 



rat_cDNA 

human_5+3_C©rr acted 
znus cD&ft 5 



^M^T<3<^3ftCGAGTG<3 GCAGCTGC^CCACGTCTA(XCTA^G 



rat__cDtflft 

human_5+3_co rrected 
maw cDNA_5 



rat_cDN& 

human_5+ 3_co r r ect ed 

mUS_CP*C\J5 



GATCXTT<KTGGTTGC36TCAQTQACGGftiVAAA 
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rat_cDNA 

humaa_5+3_correoted 



rat ct3NA 
srus_ cDNA_5 

rat CDMA 

buman_5+ 3_co r rsctea 
nus_cDKA_5 

rat_cD13A 

human_5+3_corrected 



mt cDKA 

kum£m_5+3_oorrected - 

IttUS__CDNA_5 



huaan_5+3_corrected 
nus cDNA_5 



rat OOHA 

hum^_5+3jooi:rect*<i 
unis C0NA_5 



rat_cPHA 

hu»an_5+3_e©rrfccted 
mua oDNA_5 



rat_CDNA 

human_5+3 -4 Corrected 



rat_cDNA 

human_5+3_oorrectea 



rat_CDN& 

human^+S corrected 



rat cDNA 

k\Bi£n_5+ 3_corr«ctea 
mas cBNA 5 



rat CCNA 

hun£w - 5+3_correctfta 



rat_cD*& 

human_5+3_corrected 
exu£_cDNA_5 



GCTCtO 



CCA 1 



•<3a6GCTCAfiSGCTGGAG?VftACAG 



CTGCTCCOO^MGGCAflAG^^ 



MGCCAGGCftATA 

'A 
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ratjcEHA- 

human_5+3 corrected 



rat_cD&SA 

human 5+3_corrscted 
mus cDNA 5 



rat_cDNA 

human_5+ 3_co r rect ed 
mus cDNA 5 



OCTCTGTGGA!f<KK^^ 
GCTCTGST<MGGTAACCC^^ 



rat.cDNA 

hun£ri_5+3_correoted 
noi3 CDNA $ 



TTTCCMTGGACraCAAAGTTAl^ 



rat_oDNA 

huaan_5 +3_co rr acted 
toue cDNA 5 



rat^ctftBV 

human_5+3 — corrected 
iaus ct)MA 5 



ATATTGAGAAATtJVGTCAXATl^^ 



rat_cDNA 

hu«an_5+3_corr^cted 
»ue oDNA 5 



hu»an_5+3_corrected 
mas cDNA S 



TCCCCAAGCCAAATGTCAAG^^ 
TCCCTAAGCCAAATATO^TGG?^ 



rat_cDHA 

hunan_5+3_corrccted 
X0U6 CDNA 5 



TGG&TGGAAAATACATACT^ 
TTA&TGGGAAAXACMATT<^^ 



rat_cDNA 

huinan — 5+3^corrccted 
mus cDNA 5 



ACGACOAGGAAATTAI^^ 
ATGACAGAGGAAACTATATCTGTAAGGCT 



rat_cDNA 

human__5+3_corrected 
mu* cDNA 5 



TGTCRGWATGGTTGTGGCCTACO^ 
XTOCI&flTARJMI^TGTRGOCTWXX 



rat_cDNA 

human_5+ 3^oo rrec ted 
inns CDNA 5 



TCAGGAGGACAGGGGA^^ 
TGACXiAjGGACAGG^CAGCCTTTCAGCTCCACT 



rat_CDNA 

human 5+3_corrected 
Wis cDNA 5 



AAfiTCACCTQ<5GAGACGCX3tf^ 
AAATCACAT GGGftGAlWCCTGACC&C^ 



rat_cDHA 

huaan_5+3_correct©d 

mi3 cDNA 5 



ATWSAACTGAGATGCTT^ 
JVTG<3tfU3TGAl^<3CT^^ 
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xat_cDNA 

hUE«Ln_5+ 3__cor rec t ed 



rat otfflA 

Jiuman_5+3_correCted 



rat_cDNA 

huaan_5+ 3_co rr«c ted 



rat_cDNA 

human_5+3 corrected 



htnpan^5+3_corrected 



rat cDNR 

human_5+ 3_co r rected 



rat_cDNA 

h\OTan_5+3_oorrectod 



rat cDNA 

humaa_5+3_oorrected 
mus cDNA_5 



ACATCCRSCTACICT6ACRGGBft»S<^ 



^ttctGAACTGAA ACCGAGICTCIG — TGOiVSCTCRAAGCRRRCTCTTMCTTAA 



G«3UCTTTTATTTTGCC3>*^^ ACRTTGAAACGGTlCfiCTAT 



ssssssssssssssssss^ 



rat.ctiNA 

hvcwtn_5+3_corrected 
mis cWA 5 



h\a8»n_5+3_corrected 

XKUS oDNAJS 



rat_cDHA 

human_5+ 3^co rr%Cted 
©us cDNA 5 



ssssafflsssss===assssss 



AftflA 



(rat_cDNA: SEQIDNO:7) 
(human_5+3 corrected: SEQ IDNO:8) 
(mus_cDNA_5: SEQ ID NO:9) 
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Figure 15 




rat 

s ^ human_5+3_corrected 
^ Kouse"6_corrected 

j - h.uman__5+3__correcte<a 
\ fl moua e^5_co rrec ted 

sS 3 

^ rat 

\ Q kuman_5+3_cortected 
ffi ouse_5_oorrected 



, human_5+3_corrected 
1 ^ xaouse_5_corrected 

^ rat 

1 ^ hu»aa_5 + 3_co erected 

»0U0e_5_CO3:rectftd 

rat 

hmftaa_5+ 3_oorr ected 
"5 corrected 



busvan_5+3_corrected 
mouse"5_cSrrected 



JUU UO V. «^ - — 

TOuse^corrected 
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rat 

human_5-f3_correcre<3 
mouse's corrected 



PIAVPLVELTD&EKDASGMXPFDETCftWIJ^ 
PWTQLPNIPGEEDDSSOMLA^ 



rat 

human_5+ 3_co r r eo ted 
mouse 5 corrected 



&VS-TVNPQTLG0EHXi£DFKLFSVT^ SV 

EFSPVVNSQILITEEPTDFKLSTAIKTTA^^ 



rat 

hu»an_5+3_corrected 
mouse 5 corrected 



AEIWDSAQRGI^--S0SAHmG(3W^^ 

TEFQDS DCKGRGREHFQSRPP ITVRIMIKOVHVKMLS STTNKL — LLESVNTTNSH — QX 



rat 

human_5+3 — corrected 
»OU$ft"5 corrected 



PlT(^SItfSSSDISSHrTADPSF£SHPSGSHTTAS$X,FHIPB^^ 
SVREVSEPRHNHFYSflTT0XI'STSTFP3DPHTAAHSQFPIPRNS-TVKlPLFE^GRQ 



rat 

hnman_5+3_correoted 
mouse 5 corrected 



IWSRjC^VKNPHRTPVLRRHRHRIVR^ 
I«n&RlISmTPVLRRHRYSIFR$T^ 



rat 

human__5+3_corrected 
mouse 5 corrected 



TAAXjSVPSSSHSALPKTNNVGVIAEESTTVVK^ 
tAAI^FPSAAPITFFKADIARVPSEES^^ 



rat 

human_54-3_corrected 
mouse 5 corrected 



rat 

human_S+3_oorreated 
mouse~5 corrected 



imVIPTEASMTSAPTSVSLGKSFVD^ 
SOVTPSGAVMTtAETSIIMEmKVNASYra^ 



TSTKFSKIOTPIfiQIFYIWQSOTGWLKNFYQ^^ 

AI TfcFSRRKI PWQQNFVNNHN PKG£I*RNQHKVSLQKS TAVMLPBCT SP ALPQRQ SS PFHFT 



rat 

hunan_5+3_corrected 
mouse 5 corrected 



TLLTSFPPALSTTMAATCNKGTEW 

TLSTS VMQI PSNTI,rTAHHTTTKTHNW- SLPT~KK£LPFPPLNPMIiPS ITSKDS STKS I 



rat 

humAn_5+3^corrected 
mouse 5 corrected 



LSTETPT-VTSPTATASVIMSETQ^ 
ISTQTAIPATTPTFPA6VITYETQTERS^^ 



ret 

huroan^5+3_corrected 
mouse~5 corrected 



SALTTADTPIAFSHSPRQDIKJOTV'SAVAYHSTTS — IiI^T£LrcKXTQTC«GtfrXMjETT 
TJOTP— PAUUTHSPPENTCGISSriSFHSRII^TDTO 



rat 

fauman_5+3_corrected 
aouse"s corrected 



I£SKSQE£TTVKIU^DTP-PFLL$SGAPP^ 
ISSKSHQSTTTKKASIOTPIPPFLSSSATLMPVPISPPfTQIWT 



rat 

h un \an_5 + 3_co rrected 
aouse~5 corrected 



VTryESSRHOTDLraPSAEASPl^ 

VKLHE S$RHt* — L#4PS SQLE P J<TSeTSNLLHSTPMEftLTTVKSQN3KLTPSP 



rat 

huroan_5+ 3_co rrected 
moused corrected 



WAST^HVmKPYSDIAEKGK^EVSMtATTGLSE^ 
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huaan_5+3_corrected TTSKLlpfdslsryifeKPRIVGgkaas^ 

TOuse w 5_corrected — * 

rat EISQGTQKSRFHVLP^IXSIQR^ 

human_5+3_corrected DI^RG^SRVQVW'NGTI^I^ 

aouee_5_corrected - 

rat BHVKBITTOraSTVEIiCC^^ 

hunanJ5+ 3_co rrected RRTKEITVHSGSTVEIiKCRAEGRPSprVTWI 

mouse~5_dorrected : — —————— — 

rat LSLYDIU?r]tKCVAS>^fiG0D3IXVKZOVllA^P'VI IE0KROAIVGVLGGSLKLPCTAK3T 

human_5+3_corrected LSIYDRGFYKC^ASNPGGQDSIJiVra^ 

mouse_5_corrected • 

rat PQP£n?HWVLYIX3TELKPLQLTESFrFX 

human_5+3_COrrected PQPSVYWVLSDGTEVPS^O^SiarLFSlK^ 

mouse 5 corrected — 



rat VTLTVEEGETIFRIET&SQXflTEVK^ 

human_5+3_corrected VML WEERVTSPRIEAMQKRTEVNFGORCliNCSM - - 
mouse"*5 corrected 1 



rat GSRIH^nftWGSLVVGSVTEKnMraiCVAFKIW 
humaii_5+3_COrrectOd GS^HVYPNGSLFIGSVTEKD$<^na^ABNK^ 

mouse 5 corrected 



rat viBGRDFgsttxa^^ 

human_5+3_COrreote l d VLHOKDFQyDCKASGSPVPEISWSLPDGTMIlJKfi^^ 

mouse 5 corrected — « — — ™n 



rat ««?VEE<^YIC^^^IX^EMKVHLTVLTAI PRIRQSVKTTMRLRAGETAVLDCEVTGEP 

hU«an - 5+3_OOrrectod GVAEEGDYTCY^QNTKKDEMKV^^ 

mouse""S corrected — * — . — : 



rat KPNVF9ai*SNNVISFSNDRm^ 
human_5+3_COrrected KPKIfTOJjPSNDMISFSIDIOTFHANGSLTINK^ 

aouae 5 corrected — „. . 



rat IV$RPPLINGLXANKTV:na^ 
huiuan_5+3_corrected VVSKPPLINGLYTNRTVIKAIAVEHSKKHFI^^ 

aouse 5 corrected — — ^ 



rat VTVHFNGTLEMRNIRLSDSJUJCTC^ 
hutaan_5+3_oorrected ITVHiaJ6TLEI»OTRX^06M>FICVARNEGGESVLVVa^ 

mouse" 5 corrected — . 



rat QWOTVAIiNCSVDGNPPPEITWILFIX^ 
human_5+3_corrected QWKSTAItfCSVDGNPPPEIIWILPN^^ 

mo\W^ 5 corrected — — — — — . — — 



rat RCAftRNKVGYIEIXriiEIGQKPraim 
fauman_5+3_corrected RC2\ARNK^YIEKI,>m,EIGQKFVILTYA^ 

mouse 5 corrected 
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huflian_S+3_corrected $(?WVt&WT8G!!^^ 

mouse 5 corrected ~" 



I IFH,PRHWLRRTGEAMQIiHCVALG 
humn_5+3_correoted iTHRP^IVTETGattFC^^ 

mouse 5 corrected — — 



rat LVIQNLQTS DSGVYKCRA0NLL6TDYATT YTCV 
huxnan_5+3_corrected LVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQV 
raouse_5_corrected ~~ 

(rat: SEQ ID NO: 10) 
(human_5+3_corrected: SEQ ID NO:l 1) 
(mouse_5_corrected: SEQ ID NO: 12) 



# 
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Figure 16 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

hunan_5+3_corrected 
rat 

human_5+3_correoted 



rat 

human 543 corrected 



rat 

huiaan_5+ 3_co r r eo ted 



rat 

human_5+3 corrected 



rat 

human 5+3 corrected 



rat 

human_5+3_oorrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

hwmaa_5+3_corrected 



rat 

human 5+3 corrected 



rat 

human_5+ 3 w correc ted 
tat 

human_5+3 corrected 



rat 

human 5+3 corrected 



MKVK^ITCXLVSrmCXVM 

*:*:** **;*:: # :***.***. ********+******* ** 

# * * 

*♦*«*****.*♦ *■**.**:*******+***** J, ***+*,**;***********, . 

RHTCFYGIX3SLVRI£LDHNNI^ 
QKDTFYGIJ*SLTItfjHMDHNNIEFINPEVFYGLN 

**.***J*«**+****+* f **A* t *********, **************** 

I FKT5 FI KYLFLS DNFLTSLPKEWSYMPNLESLYLHG^ 

IFKISnKFLYI^D»PLmPQiai7SyMPDIiD3IiYiaGNiW^ 

*** ****,*, ********** ^ ******* i*;****^**************.,.* 

********,**.****,**„» »:«»***.ir«*w. T?.. 

flAFI SPQQEWAPPGSLTLNMTDQSGNEANMVCS XQKP3RT3 PIAIfTEENDYIVLNTSFST 
** ***.*: *****;**** *********,**♦**** 

DYGHlQPVWQIlALYSDSPIiI LERSHLISETPQLYYKYKQVRPKPEDIFTNIEA 

DI^PSWLMQDQISI^LNRmTTFSTLQIQYSSDAQITLPI^^ 

*:«*** » : * ;: * fc********,******.*,****,******^ . ***** „ 

PKLERTVLVGOTIM,SCF<aCG^ 

TKLEHTVLVGGT^KCPGQGOPTPHVI^^ 
.***:*******:,*.+**!***;**♦ 

AP$ FDAGLYHCI STND&DADVLTYRITWE PYBESTHDSGVQHTWT GETL DLPCL3TGV 

ABSFDTGVYHCISSNYDDADILTnaTVVEPLVEAY^GIHHTm 
**>**.★.*****.* ************** *, • **+•;***_ ******** ***. 

P^ISWILPGOTVFSQPSRDRQII^GTL^^ 
PIMISWVIPGfcmVLYQ33M3KKVINN<OT^ 

♦**♦***. ;***.*. *.***.;.************ ***** i *,****+*** t ** *,* 

SVQKKGQRMVKHDREA^ 

«*. **** *. **** # # - * # m **,*, ? * .*** 

HRDL IHRI^GDSTLRRFRSHBKILPLSARRI DPQRKAALIiEKAKKNSVPKKQEHTTYKPV 
YRELTLQ^GDSTHWlFRjENRi^FPPSARRIDPQHW^ 

; + : * ******* ****♦,**,,* ********,***********. ^^i*^^ 

*IAVPLVELTDEEKD^GWIPPD£ER^^ 
PVVTQLPiTCPGEEDDSSGMLA^^ 
*:.. * ::.,**.*:***:. *#* 



**** . ***** 



BVS-TVNPQTt^BmPDFKUWTNGTAV^^ 
EFSWVNSQILPPEEPTDFXLSTAIKTTAMSK^ 

*.* .**.* * .«*** :- . **..*_ :**:::*;*-.***★;, «** 
TEFQDSDQMGRCaiEHFC^RPPITVRTiaKD^^ 

:* 2S ** * *+. ** *♦* # * ** * ^ 4 -A ^ <<fr< w 
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rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

humar^5+3_oorrected 



SVREVSEPm*HFYSHTTQIL$TSTFPS^ 

f? **,* **** * * * ****, * *;** *:;**:*, 

IWSl^VKNPHICTmitMRHElVRPAIKGPi^ 
IGGRCnaiSPYRTPVLRI^YSlFRS^^ 

* ,***: .*********** j .*.: :*,::*..! »*** * :* . * **.* 
TAALSFPSAAPITFPKADIMWPS^STTLV^ 

.*****,**,. ;* + *:: . * ,***«*.* j ;****;; ; * .;* *.* ** * 

jmVIPTEASOTSACTSVSti^PV^ 
SQVTPTGAVMT*APT8IPMEKI^^ 

<; * ** * «* ****•.; *. *..; s#t * :: -*** :* 



rat 

human 5+3 corrected 



*0 



*0 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



TLSTSVMQIPSNTLTXAHHTTTKTHHPG- SL 

** ** *.*:::itl. *:. . . **-: *: **• .*:*** *** *.* .ti 
STETPT-VTSFm*ASVIMSBTQOT^ 

STQTAIPATTt^TFPASVlTYETQTERSWlQT IQREQEPQIGCNRTDPNISPDQSSGFTTPT 
**:*♦ ,*:** .**+* *** **: . * : * t***,:.* :* * : 

ALT TiU^PXATSHSPRQDOG<^SAVAYH5TTS - -LIAI TELFEKYTQTLGtfTTALETTL 

am:p— pMjAFThsppeottgisstisfhsrtijN^ 

..***:*** :: * * -* ** : **** .* * **** 



rat 

human 5+3 corrected 



LSKSQESTTVSlAS-DTP-FPIiSSGAP^ 

S SKSHQSTTTfcKASIiDTPlPPn*SS SATlKPWISPPFTQ^VTDTRGDSRn^IWTNTVV 

:** . - * * J J J* ** 



rat 

human 5+3 corrected 



KLHESSRHN — LCMPSSQLEP LTSSTSNU^STPMPALTTVKSCfNSKLTPSPW 

,♦.****** ** **:: .* *.*.***: ♦*.:*** , *.*♦** ,**** 



rat 

human 5+3 corrected 



PKIKyQLKSYSETIKKGKRFAVS*^^ 
AEYQFiraKPVSOIAEKEmi^SMIATTGL^ 

«*?S2 *.**; ****** *** , ***** *:. 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



T SKHLPYY3LrKTLLKKPRl IGGKAAS FXVPANSDVFLPCEAVGDPLPI IHWTRVSS GXE 
TSKLLPFDSLSKTCFEKFRIVOGK^ 

*** ** ; **,* • ; ;****$******** j***W* _******** J*** ******* * ; 

.*.* 4 * : ** v ,************,*********** * : *_**j**j**** ** # +** : * 

KVKEITVHFGSTVELKCRVEGMPFPTVSS^^ 
RTKEITVRSGSTVIXKCRAEGItfSPTVra 

+ *********.** * ***.*********** | J J***: . ** ****♦;;*« 

SLYmC^TKCVASNPSGQDSLLV^ 
SIYCRGFYKCVMNPGGQDSIiVKiaVXARP^^ 

*$*************.************;*****.**«**,***. * ************ 



rat 

human 5+3 corrected 



QP3VHWVI,YDGTELKPLQLTHSRPFLY 
QPSVYWVLSDGTEVKPLQFTNSKLF^ 

******** •*#**.#***• *♦*-. ******* .* * ***********.******* 



rat 

human 5+3 corrected 



KLTMEttVTtSttW*^^ G 

* ********* ***** *.**»*******;***;*;********;** * 
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rat 

human_5+3__corrected 



rat 

hujnan_5+3_correoted 



rat 

human 5+3 correoted 



rat 

human 5+3 corrected 



rat 

human_5+3_corrected 



rat 

human 5+3 corrected 



rat 

huma»_S+3_corrected 



rat 

hu»an_5+3_correoted 



rat 

human_5+3_corrected 



rat 

human 5+3 corraoted 



rat 

human_5 + 3_corrected 



SBIHVYPN<3SLW<^VTEKD^Y^^ 
SWlHVYFHGSLFIGSOTEKDSGVttCV^^ 

* *********, : **«**»*i* *************;**++ ***.*****«****:*** 

LHGKDFQVOCKASGS PVPEVSWSIJDGTVI#Nl^^ 
LHGKDFQVDCK7VSGSPVPEI WSl&VGTtlXmMQM^ 

******************* s ********jt**. ****** ; ** : *****j*******;** 
VAEEGDYTC^QNTLGKDEMKVHLTVITiy^ 

: ****** * ******************* ****** **, *: : **3**********:** 

PKEFIfljLPSHDHIGFSI ERVTFHANGSLTINKVlOLLDSGEyVCVMtN? 
* ?: ******* :: **** **,***** .*,*,*** ****.*****. ******** 

SKPPL INGLYTNRTVI KATAVRHSKKHFDCRAEGTPSPEVMWIKPDNI FLTAPYYGSRl 
i**********^,*******.*****:*****!* **.;* ****,****.*++:***: 

TVEPNG^LEMRNIRLSDSADFTCVVItfEG 
!PraMGTl£rMJVM.fiI)SADFICVftI^^ 

*** *****;**♦******** a*.*.*****************************;:** 
AGKFV^CSVDGN?PPEITWI1PD^ 

LGKSTALUCSVMNPPPEI I^LPNGTRFSNGPOSVQYLIASNGS FI I SKTTREDAGitfR 

**_************** ****)**;♦;* ** : * b +**j*j *J**: . 

CaARKKyGYIEKLIIJxElGQKPmTYEPCam^ 
C&AFHKV(nriI^VII£lGQKFV^ 

*************. : ************ ** **,;***„**************•*** *, 
GYVVPWrNGKyiLHDNSTIiVIKE^ 

*.*.****- .******!******* •#*}*• ****** | | ******* 

INYLP3^MLRMGEAMQL8CVALGI PKPKVJ^ETPRBSLLSKAT^^RKPHRSEMLEPQGTL 
TtmPPRSIVTRTGAAFQIiK^MGVP^ 
* **,:: *** * t ********* ***• .*** * *****, *j .:.* ** ** **** 

VIQNLQTSDSGVYKCRAjQSOTiLGTDYMTYIQVIi 
VlCfNPQT3DSGIY!ra?AKNPIiGSinPftATYIQVI 

**+* ********** *j* **;♦**;**+**; 



(rat: SEQ ID NO: 13) 
(human_5+3_corrected: SEQ ID NO: 14) 
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Figure 17 

MQKRGREVSCLLISLTAICLVVTPGSRVCPRRCACYVPTEVHCTFRDLTSIPDGPANVER 
VNLGYNSLTRLTE>TOFSG1^R1£LLMLHSNGIHRVSDKTFSGLQSLQVLKMSYNKVQIIE 
KDTLYGLRSLTRLHLDHNNffiFINPEAFYGLTLLRLVHLEGNRLTKLHPDTFVSLSYLQIF 
KTSFIKXLYLYDNFTSLPKEMVSSMPNLESLYLHGNPWTCDCHLKWLSEWMQGNP 

(SEQ ID NO: 15) 

Figure 18 

MKVKGRGITCLLVSFAVICLVATPGGKACPRRCACYMPTEVHCTFRYLTSIPDSIPPNVE 

RINLGYNSLVRUvffiTDFSGLTKI^LLMLHSNGIFfriPDKTFSDLQALQVLKMSYNKVRK 

LQKDTFYGLRSLTRLHMDHNNIEFINPEVFYGLNFLRLVHLEGNQLTKLHPDTFVSLSYL 

QIFKISFIKFLYLSDNFLTSLPQEMSYMPDIJDSLLYLHGNPWTCDCHLRWLSDWIQPDVI 

KCKKDRSPSSAQQCPLCMNPRTSKGKPLAMVSAAAFQCAKPTIDSSLKSKSLTILEDSSS 

AnSPQGFMAPFGSLTLNMTDQSGNEANMVCSIQKPSRTSPIAFTEENDYIVLNTSFSTFL 

VCNIDYGfflQPVWQILALYSDSPLILERSHLLSETPQLYYKYKQVAPKPEDIFTNIEADLR 

ADPSWmQDQISLQLNRTATTFSTLQIQYSSDAQITLPRAEMRPVKHKWTMISRDNNTK 

LEHTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKWAPYVSEDGRJLIDKSGKLELQM 

ADSFDTGVYHCISSNYDDADILTYR1TVVEPLVEAYQENGIHHTVFIGETLDLPCHSTGIP 

DAS IS W VIPGNN VLYQS SRDKKVLNNGTLRILQ VTPKDQG Y YRC V A ANPS G VDFLIFQ V 

SVKMKGQRPLEHDGETEGSGLDESNPIAHLBCEPPGAQLRTSALMEAEVGKHTSSTSKRH 

NYRELTLQRRGDSTHRRFRENRRHFPPSARRIDPQHWAALLEKAKKNAMPDKRENTTV 

SPPPVVTQLPNIPGEEDDSSGMLALHEEFMVPATKALNLPARTVTADSRTISDSPMTNIN 

YGTEFSPVVNSQILPPEEPTDFKI^TAILTTAMSKNINPTMSSQIQGTTNQHSSTVFPLLLG 

ATEFQDSDQMGRGREHFQSRPPrrWTMIKDVNVKMLSSTTNKLLLESVNSHQTSVREV 

SEPRHNHFYSHTTQILSTSTFPSDPHTAAHSQFPffRNSTVNIPLFRRFGRQRKIGGRGRIIS 

PYRTPVLRRHRYSITOSTTRGSSEKSTTAFSATVLNVTCLSCLPRELTTATAALSFPSAAPI 

TFPKADIARVPSEESTTLVQNPLLLLENKPSVEKTTPTIKYFRTEISQVTPTGAVMTYAPT 

SIPMEKTHKVNASYPRVSSTNEAKRDSVrrSSI^GAITKPPMTnAITRFSRRKIPWQQNFV 

NNHNPKGRLRNQHKVSLQKSTAVMLPKTSPALPQRQSSPFHFRRLSTSVMQIPSNTLTT 

AHFfirTKTHNPGSLPTKKELPFPPLWMLPSnSKDSSTKSnSTQTAIPATTPTFPASVr^ 

TQTERSRAQTIQPvEQEPQKKNRTDPNISPDQSSGFTTPTAMTPPALAFTHSPPENTTGISST 

ISFHSRTLNLTDVffiELAQASTQTLKSTIASETTLSSKSHQSTTTRKASLDTPIPPFLSSSAT 

LMPVPISPPFTQRAVTDTRGDSHFRUvlTNTVV 

HSTPMPALTTVKSQNSKLTPSPWAEQFWHKPYSDIAEKGKKPEVSMLATTGLSEATTLV 

SDWDGQKNTKKSDFDKKPVQEATTSKLLPFDSLSRYIFEKPRrVGGKAASFTIPANSDAF 

LPCEAVGNPLPTIHWTRVSGLDLSRGNQNSRVQVLPNGTLSIQRVEIQDRFQYLCSASNL 

FGTDHLHVTLSVVSYPPRILERRTKEITVHSGSTVELKCRAEGRPSPTVTWILANQTVVSE 

SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVILEQRR 

QVIVGTWGESLKLPCTAKGTPQPSVYWVLSDGTEVKPLQFTNSKLFLFSNGTLYIRNLAS 
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SDRFTYECIATSSTGSERRVVMLTMEERVTSPRffiAASQKRTEVNFGDKLLLNCSATGEP 

KPQIMRLPSKAVXTDQGSWfflYPNGSLFIGVTEKDSGVYLCVARNKMGDDLDLMHVSLR 

LKP AKIDHKQ YFRKQ VLHGKDFQ VDCKAS GSPVPEIS WSLPDGTMINN AMQ ADDS GHR 

TRRYRLFNNGTLYFNKVGVAEEGDYTCYAQNTLGKDEMKVHLTVITAAPRIRQSNKTN 

KRIKAGDTAAVLDCEVTGDPKPKIFWLLPSNDMISFSroRYTFHANGSLTmKVKLLDSG 

EYVCVARNPSGDDTKMYKLDVVSKPPLINGLYTNRTVIKATAVRHSKKHFDCRAEGTP 

SPEVMWIMPDNIFLTAPYYGSRITVHKNGTLEIRNVRLSADFICVARNEGGESVLVVQLE 

VI£MU01PTFRNPRNPF^EKIVAQLGKSTALNCVSVDGNPPPEIIWILPNGTRFSNGPQSY 

QYLIASNGSFnSKTTREDAGKYRCAARNKVGYIEKLVILEIGQKPVILTYAPGTVKGISGE 

SLSLHCVSDGIPKPNIKWTMPSGYVVDRPQINGKYE.HDNGTLVIKEATAYDRGNYICKA 

QNSVGHTLITVPVMIVAYPPRITNRPPRSIVTRTGAAFQLHCVALGVPKPEITWEMPDHS 

LLSTASKERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 



NO: 16) 



Figure 19 




Signal Peptide 
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Figure 20 
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Figure 26 
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Figure 27 
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Figure 28 
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Figure 30 
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Figure 31 
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Figure 32 
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Figure 33 



iu 




f J. ' ^ IP* 



61/85 
Figure 34 
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Figure 36 
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Figure 38 




t get; 



Figure 39 



SfrAI 03123) 



aw* 

Ne<40*70> 




64/85 
Figure 40 
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Figure 41 
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Figure 43 
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Figure 45 
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Figure 47 

^(Xb«I) 

TCTAGAA GTAAAATQATCCTGAGTAGCGATCCrGGGAAAATACGTAC^^ 
AATCATCTCTCTGTGGTTTGCnXK3AGCTGAGGTCT0GAA 
ACCTACCCAATACAGAGCTATGAajTTAQTCTGGAAGGAGCITIXKjAAGAAT^ 
CTAGCTXXXCAGAACATACTAGATGCCATAlTrc 
CTTAAGAATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCOTGa^ 
GGTTAGTGCrAACTCTGACTOCTTGACTCTTTAATTCTACCTTGAT^ 
ATCACTTGGTGATTAGCAACTGGATATGGAATATTACTAATTTGTACCCAAGCCAGGCCAC 

CrCAGCTTTGGCAGCTCCATTCATTCTGTGGAGC 

ACTGTTTCTACTTACAAGACXKJATTACCTGAGATGAGTCAUlTllCrTCACAAGTCTTTTTA 
GAAGAGTCAATTAGACATATTCTQATGAAGTAAGCATATAAAOTGAGAGCAGCATGAATG 
TGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAATAAACTG ACTTT AAAAAAAAA 
AGCTTATOATACTTGTCACAGAGTAAATCTTCCATAAATATCATCTGCATTTATAAATTAT 
TTTCATAATCCATCAATTAAAAACCTTTAGAAATTTTGTT^ 

CTGCCCTAGGATGGTCTTGTATQGTGGGCCTGAGAGATGGAGCTTAAGAACTTACTl^ 
AGGAGCACATCTTCAQAACATCTGCCTCAAAACATTTATC^ 
ACTCACATQTGCITCAACCACAGGGATTAAACAGTCATTTTAGTCACATTT^^ 
GGAAGCCTGCTAGAaGAACAGGATGTATCAGGATAACATC^AAOCITACAAAAGGATGTC 
ATAACCCICACCACAACAAACAAC^COACAACAAACCCATAAAaATT ATCAC OGCAAAT 
p gAACTAAGCCATATCCAGaAAAAGTATTATATQ^ 

i n AGTCAAATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATA 
£ CrrGTAGATTAGACXIACACAAAGCAACAACTACAGAGTGAAGTAATCCAGAGACTTAATG 
!~ TATAATATGACKlACTGTATTTAATAATTCTATTTAAGATACACAGCAAACXJAQTGTATCrr 
I « ACTAACACACACACTTACATAGAGAGAArAAAGTGATAGATA^TTTGTriTATCTTCATG 
\ 0 TAGCTGATAATTTCATATTGTACACXrrCAAACATAGATAACCAACAAAGAGQAAGAGGAT 

AGQTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACACCATTGTGTAG^ 
U CCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAGGGATCTGCAACCXrrATAGG 
H TGGAACAACATTATGAGCTAACCAGTACXCCGGAOCTCTTGACTCTAGC^ 
^ AAAAGATGGCCTAATCGGCX^TCACrrGGAAAGAGAGGCCC 

!=i ATGCCCCAGTACAGGGGAATACCAGGGCCAAAAAGGGGGAGTGGGTGGGCAGGGGAGTG 
CKKSGTGGGTGGATATGGGGGACTTTIXjGTATAGGATTGGAAATGTAAATGAGTTAAATAC 

15 CTAATAAAAAATGGAAAAAAAAAAAAAAAAAAAAAAAAAGGAAGGTCAGACACCT^ 

'~ 5 TTCACTGCTATCTCAACTTGCAAACAaAAGGGGAOTCACAAACCCAGGACAAAC^ 

!= GATTGAAGCGTCTTTGMTGTTATTGCTGT^^ 

TGTGAAAACTTACXSGGGTCTATGACATGTTTTnTATTC^ 

ATATTrOGCA<XACTACCAGrcCCAGCCCCC^^ 

C^CACACACACACACAC^CACACACACACACACACAGACaCACCTT^ 

catcatctgctcaci^cxx:accc^ 

CTATTmATQTCTAGGTTCCCXXntCCCCTGmGGAGATGaGA 

agaatttgtagcccctgagccagcccgggccacagagcctgccaccagacaggaaaagcc 
cagggcttaccagcacackiaggagcaaactcgcaggcgagcctgggttggc^ 

CXXXKjGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATAGCTGCGT^ 

CCCTCCTCC<X}AGTGAAGACCCCTAGAGGCTGAGCAGACCCCAAAGO^aGTGCAATTCCA 

TTGQCCCAAGGCAGAGGTGAGCGGCIGCTAATCCCCTCGGGAAGTGAAGG^ 

Exonl 

AGTCTGGTAGATGTGGGAG^TGGGGTTCAGGGCGAQACAGAGGGTGX3GATGGG£^ 

GCTt*X^OGAAAAG GAAAGTACTGGA GGGG^AGTTG^K^ACAAAAGCA^CGACCAAG^GAA 

r.ATrrarrTCAQTQACTOAAOOCAftGCAAAACX3AQ^ 

(WrTTTCATAA ACACTGATQACGTOTITgr^^ 

r^ArrtTrraAAAQAAraAGTGATCGATTACTACTTO^ 

TA APTA ArrrTATA^ATTOAGr^AGAAGGAAQAGCATTtX^^^ 

GT-A ATCAG ATAAAGG AA AGCTTTGGTT A GTTTGGAAATGTATG AT ACQ ATT A A A ATAACA 
GAAGCGCCTXX^AGTrCrCTGAAGAfl TCAQTCCCCXZAflCTAGTG AfiCCTACTAA 
fKTTTrrfKrrCCCGTTGGAAGCAAAGAACGTTCCT^ 
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AGATTTCCTGTCTCTGCTTATGTTACAAGAGGATTC 
TATTGCCAACTCTTTTGT^^ 
TTCATTTACXKKrrCTCTTCTCTTA 
CAGCTACCTTGTACXATXrTTTGTTTA^ 

AOQCAGAGAGTATTCAAAAATATTCAGTOATOTAACAGTGACAGTGTAGGCATAGAAGTA 

ATCATTAGTAAATCrTAATTTGGGTTAAACTCATTCATAAC^ 

CACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTTTCrAA 

CTTOGCCATGCTCCCCATCACTOTGTCAGTA^ 

AGCAGACAGTGTTATGCCTGCAAAGTCAGAGACI^GCCCT 

G1XX:TCCCGGTCATTAGTTGGCTCTGAAAAGGC(XATGTGTCCTTA 

AGACATGCTAGAAAGAAATTTGACCnTnTTrCTA 

TTTGGAAGGITAAGCCAAATAAATAAAACACATATTAAATAATACAATQTTA 
GATCATATAAAGAAGTACATTCATAAATGCAATGTC 
TTACTGGTGCAAAGTTTTCTAAATTGCACATGTACTATTT^ 
AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTT 



TTAATTGTCCCGATAACATTATTAT^^^ 
AATGAGAAGCTATGAAGATATGTTTACMTAATAAG<XCATTrAGTC 
GGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCC^ 
CTCTAGGGAATAAGAGTTCCTCTTTCT^ 

TTCTTGATGATGAGC^TTATAAGTGGAATAAGATAGGATCTCAAAGGAATGT 

ATGCCCTGAACAAlXTTTTCAGGTCm 

TTGGGGGATGGTGTTAATTTTTTTGCAGTTXnTATGGA^ 

AAACAAACAAAAAACCTCTGAAACTAGAACTACCAATCCATTACT 

AGAAATCIXX^CAGAATTTATTGCTACATTGTTC^ 

ACCAACITACGTAGCCGTCAAAATATGAACG 

AGAGTC^ATGTGG<XATAAAAGAGTGAAATCATGACATATGCAGGAAATGGATGCAACT 

GGAAATCAATTOGGCTAATCAAAACAAGACAGACTCAAAAAGGAAAC^ 

CTCTGACAAACAGAAGCTAGATTTAC^CTTGTACGTGC^ 

TAGTTATACACTATTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAG 

AGCTAGCTGGGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATA 

GTAACATAGGTG<XAGGATGAAATGCATTAATTTGTATGCrAACT 

GGCAC^CGTTCAAACCAGGGTGAAATCCCAGCACAG 

TTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCT^^ 

CTCCAGTCTGACACTGTGTATAACAACCAGTIXiA 

rTTTTGTGCTATTTTTCATTTTTTT^ 

QCTGTGGTTTTCATTTGTTrcTTTTGAGAGAGA 

AGCTGGAAACXUTCTGGAAGAAGTTGGGGAAAGAGAAAAATTGTA 

iCAAACAAACAAACAAACAAAAGGTTCATTTTGCCACAAAA^ 

^AGTTACGACTCTTAAAGAAAATATTCCCAATTAT^ 

7TAGGACTTTGCTTGAACTGGCCCTATAACTC - Artificial 




Nod lite 



(SEQ ID NO: 17) 



I 
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Figure 48 



CACAGACCTTCCTCTTCTAACCTCT 
CUXWAAGCACACTCTGCT^ 
OCAGACCCACACGTTTCCAACTGCrcCTGAGA 
GAAACCTOCTQTOATCTCCATTTCTTCTCT^ 

TTarrcAAACxjiTixxnxn^^ 

TCATTGGGAAOXrroTOCTCCATCC^ 

CTTCWTCAGTGCAGIXnTITQTATCCrc 

AACTGTTTTTrGTTATGATTGCTGCAATCTCTTTC 

TCCTTTGTCTCCTCXrrTrCAGTACCA 

ACCAAAGTGATATAGATTCATGGCATTaTCTAACT^ 

ACITGGCAGGTGTAAAAGCCTGGAAGCAGTCACXJAGGCAGTTTCACAGAAAC^ 

CTGGAACCTTGGCATTCCCATAGCTAGMTGC^ 

GGTCTTGCATGCTTTCTTGCAGTATTTTAC 

TAGCAAAAGTATACCTAGAATCTTCATGACAGTCAGGTATTGCAA^ 

GAAGAAAGTTGGTAAATTCTTCTGACAAA 

AGCTAAGTCACACTCATATGCAAGAATTTACCAAGGCCrAGG 

GCTTTATTCATGATAAGGTCTGCTAGAGCAGAACXXXX^ 

AAAGGTGTAGCATAAATTGTGACTAGAGTGTGAAATCTTTACCTGTC^ 

GGCAGAGCTGTTTTATCITrACTGTAAACATTACCTG 

TCCTCTGTTTTGTGACAGATACrrCT 

TTTCIXjTATTATATAAGCCTGGTGTTCCCITIGTGAAAAA 

ccttgtgtctgtgtccttttgtcatttct 

gtcttttccacagattgagggaggcc^^ 

gctcactgttggtgactacctc^^ 

aagtgtgggtccaggaagcaggttgtacaccc^ 

aaataagttaacccatctcttcctcgtttgctaatat^ 

tgctgctggtgtgtcto^tc^^ 

TGTCCATTGrrTCTT<^^ 

AATCTWK5AATGACTTTICTAAGAAATCK^CACGTG 

TAAACAAGATATATGGTGCCK^GTTTATAATCATAATAA 

CATTCTCATTCTGCITCXACGCTGAGCATCC 

AAAGTAGTGATTTATGACCTGCTrrGTTCTGTTACT^ 

ATGTTGAAACATGGAACTTGGGGTGACCTGTATCTGTO 

TTTGTCTCCAGAATAAATGAGTTTATC^AC^ 

CGCCCXJTGGAGTCCCACCATTCTACTTCCTC 

ATGTGATTCTTCCAACTCCAAATATTTCAAAT^ 

ACTAAGTCTATTGATTTtKlAAGATCTCAGTGGrrAGGATTTG 

TCAGGCTAAGCIXL^AAAATATGACAAATGAAATGTCAAATGTC^ 

AGTGTTAGGGGGTTTTAAAGAAACAAATACXJrACrCT^ 

CTATTCTAGTTTTOAATAA^ 

TTATCTTGACXAATCTCAGTGACT^ 

AGGTCTCATTCAACCACATGTTTA^ 

ATTCTTCCAGAGCACTGGAATTGGGTTO 

TGTGCTAAACAGAGAGTTACAAGACXTTm^ 

TGACAGAATGTCAGAGTOGAGCTCXXACTGGGGGAGGGGCTGKJT^ 

TTGGAACATCACAGAAAGAAATTCCAAATTATGAAATGCACATGATC 

ACTTTTGACTC7TGAACATGAGCTTTTAAAGTAOT 

GTGAAGGAAAGCTCGCCAACnXTTTTTAT^ 

GACAGTATGCCAGTCCCAAGATCTOT^ 

AAGGCAGGAGAAGGCAAGCTACAGTGAAGCCCAGTTCACTATAAAGCTTTA 

AG ATAAG AAGGGTTTCTAATTTTTAAA 1 iilill 1ATTAGATATTTTCTTCATTTACATITC 

AAATGCTATCC^AAAAGTCCXX^ATACTC^CCCCTC 

(XATTTCITGGCCCTGGTTACr^ 

ACTATCTGGTCTATIXKAAGAATGGCT 

AGCCAAACAGTGGAAGGAACTTGGGGACTCTTATGG^ 
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GCCCCGQAT(HKX3AAA(K3AACTCCACA<X»AAGACCAACAT^ 

ACx:crroaa<Knxnx^aAGTCTGAAccA(XMCCATAQ 

axntnrcACTCATATGTMCAGATATOTG^^ 

GAT0OG<K}TTAQ<KK3T0O<K}ATaO0<K}mTCTCAAM 

TXJTTOTACTGAQCTGCCTAGTCTG<j<X^rCA 

GTAAAGACTTGAAGTTCItSAGGTQTOGGTGGAGGGTATACTCA^ 

TA(K>AAOAOAAGAQOAOOG<3OMGACTT0O0<KJAGGGGGCAGTOAGCA 

A7GAATAAGAAAAAAAAAATAAAAATAATTAAAAAAAAAAAAGAATGGCTGCTGAGCCC 

TACreTAAAACCATTGCATXXXCCOCCCXXAATCAn^ 

OATACTAAGAATAGATOTAGGAAACTATTGTTAACIT^ 

GAACCTTTTTTATTOAAGCTTTCACACAGAGCCTTGTTCTT^ 

TAGCTTAATGATGTTX^GTGAATTAAAAATAACAATGAAGAATAAAGACAACTOTATTTT 

AAGQATTCTTCOTATATATITAAAAATCTAAOQTOGTC^CCT^ 

TTTCTATATATGTTTACTCTATCG^ 

CAACATAGTATGTAACTTATAAGAAAGTAAAAC^TTCATGAAATTGTGAAGGTTACTnlXJ 
CTTACCCTCAQAAACACrrCKKJTTTGAATAATTCTTAT^^ 

AGAATOTAATAACCTACTAAGGCAAACATAGAAGTrGAAATrAAAAAGAGTAGACAGGA 

GAAGTAATAAGGCAAATAATGAATATTIXKTrTAAATAGTrCTTAATGTATCATCTAACTA 

GGGTGTGATTCTCX3AGACTTOACTCCATCCAAAATATCCAAAATQACTCT 

TTGAAACAATGTGrrcAAAATAATAAACATTTCarACTTGAAAATTC^ 

TTGCTTTTTATTGCTGTGATAAGGACXJATGACCAAAGCAGCTTAT 

(SEOIDNO.18) 
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Figure 49 
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Figure 50 

TAAGCCTTTTGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCT 

CCTCAGAAGATTTCATGTCTCAGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGA 

AGAGCTCAGGTATAGCCAACTCTTTTGTTAAATACAGTATGAGGCTTAAGTGTACGG 

CAACTCATGTGGTATTCATTTGCGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCA 

TATAGTCTCTTCTGTTTCCCAGCTACCTTGCACCATCTTTGTTTATCTAATAATAGCA 

AGCTCATCTGCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTA 

ACAGTGACAGTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATATGGGTTAAACT 

CATTCATAACAGCTCCAGGTTGG (SEQ ID NO: 19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 

TTTATCCAATGGAATACTACTCAACTTATTAAAAACAACGACTTTCATAAAATTTTTA 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTCTATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCTCTGAGACTTTGGCAGATCAGACACTTAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACTTGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATGCTATTAACCTTTGATGTTCTATGGACTCTTAATTTTATGGTAGAAATGTCA 

ACAAGCITGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCTTTCAGTTTTG 

AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTCCTAAAGCAATTTAACCCCCCAA 

GTAGGCATAATTTTAATGCTTACTTCATCAGAATATATCTAATTGACTCTTCTAAAAA 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTGGCTTTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCTCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCTTGGTTAGAAATAACCTACCGAATAC 

AGAGCTATGACGTTAGTCTGGAAGGAGCTTTGGAAGAATGACAAGCTGTAGCTGCC 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTTTAATTCTCCCTTGATCATTAATGACTAGA 

AATCACTTGGTGATTAGCAACTGGATATGGAATATTACTTAATTTGTACCCAAGCCA 

GGCCACCTCAGCTTTGGCAGCTCCATTCATTCTGTGGAGCCCAGTCACGTGGGTTTG 

AATCAACTGTACTGTTTCTACTTACAAGACGCATTACCTGAGATGAGTCATTTTTCTT 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 
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TGAGAGCAGCATGAATGTGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAA 
TAAACTGACTTTAAAAAAAAAAGCTTATGATACTTGTCACAGAGTAAATCTTCCATA 

aatatcatctck:atttataaattattttcataatccatcaattaaaaacctttagaaa 

TTTTGTTAACACAAAGATCCCTAGGCCCCTGCCCTAGGATGGTCTGTATGGTGGGCC 

TGAGAGATGGAGCTTAAGAACTTACTTGCTCCAGGAGCACATCTTCAGAACATCTGC 

CTCAAAACATTTATCCCAAATGCTCATCAAAGGCTCACTCACATGTGCTTCAACCAC 

AGGGATTAAACAGTCATTTTAGTCACATTTCTCAAACGGTGGAAGCCTGCTAGAGGA 

ACAGGATGTATCAGGATAACATCCAACCTTACAAAAGGATGTCATAACCCTCACCA 

CAACAAACAACAACGACAACAAACCCATAAAAATTATCACGGCAAATGAACTAAGC 

CATATGCAGAAAAAGTATTATATGTTCTCATTGTGGGGTGTTTTTCCTTAATAGTCAA 

ATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCGAGAGACTTAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATCTTACTAACACACACACTTACATAGAGAGAATAAAGTGATAGATACGTTTGTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATTGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTTGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

CGGGGTCTATGACATGTTTTTTTATTCAAGTATATCACATGCTGTCAGCATATTTGGC 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 

TCATCTGCTCACTCACCCACCCAAGCTTAATCCTTTTCCTTCCCTGCAATAGTACCTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACGCCTGGGACGTCGGAAAGAAGGA 
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GTGATCGATTAGTACTTGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACTA 
TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 
AGGAAAGCTTTGGTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 
CCAGTTCTCTGAAGAGTCAGTCCCCCAGCTAGTGAAGACTAAGCCTACTAAGCCTTT 
TGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 
ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 
GTATTGCCAACTCTITrGTTAAATACAGTTTGAGKjCTTAAGTGTACGGGAACTCATG 
TGGTATTCATTTACGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCATATAGTCTC 
TTCTGTTTCCCAGCTACCTTGTACCATCTTTGTTTATCTAATAATAGCAAGCTCATCT 
GCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTAACAGTGACA 
GTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATTTGGGTTAAACTCATTCATAA 
CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTT 
TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 
GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 
n GAGACTCAGCCCTCCCAGCTGGTCAGTTTACTGTCCTCCCGGTCATTAGTTGGCTCTG 
!5 AAAAGGCCCATGTGTCCTTATTGGCAAGGACTTGCAGACATGCTAGAAAGAAATTT 
*0 GACCTTTTTTTCTAGTGGGTTATTACAGCTGTAAAAGTATTTTGGAAGGTTAAGCCA 
□ AATAAATAAAACACATATTAAATAATACAATGTTACAAAAATTGATCATATAAAGA 
tH AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATTTACTGGTGC 
AAAGTTTTCTAAATTGCACATGTACTATTTTTATATTTATAAAAATATTTTTAAAATG 
■ U TATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTTACAAACTTTA 
•° ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCTTTTCATTGGGA 
L AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 
'J ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 
>2 GTTTACACTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACTAGCATTTGTTGT 
1 3 1 C ATCATTTATCTTCTTGATGATGAGCATTATAAGTGGA ATAAGATAGGATCTC AAAG 
3 GAATGTCAATTTGGATGCCCTGAACAATCTTTCAGGTCTTTCTTTCAGTTCACTAGTC 
P TATTCATTTATTGGATAATTGGGGGGATGGTGGTAATTTTTTTGCAGTTCTTATGGAA 
TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 
AATCCATTACTGGGTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 
TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 
AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 
AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 
CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 
CTAGATTTACACTTGTACGTGCGCATGTGTGTTTAGAATTTTATTTAGTTATACACTA 
TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 
GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 
ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 
GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 
AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 
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CCAGTTTTCTCCAGTCTGACACTGTGTATAACAACCAGTTGACAATACAAAGTTGGC 
ATGATGGATGGTTTTTGTGCTATTTTTCATTTTTm 

GGTTGTTGTGGTGGTGGCTGTGGTTTTCATTTGTTTCTTTTGAGAGAGAGAAGGAAC 

ATGAAATTGGGTGGGTAGGAAGCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAG 

AAAAATTGTATGGAGCATATTTAAACAAACAAACAAACAAACAAAAGGTTCATTTT 

GCCACAAAAAGGTGTGAATTAAATTAACCAGTTACGACTCTTAAAGAAAATATTCCC 

AATTATTCCCAGAGTTGCTATGTATGCTGTGCCTAGGACTTTGCTTGAACTGGCCCTA 

TAACTCTGGTGTGGTGTCTTTTCAGGATGCAGAAGAGAGGCAGGGAAGTCAGCTGCT 

TGCTGATCTCCCTCACTGCCATCTGCCTGGTGGTCACCCCTGGGAGCAGGGTCTGTC 

CTCGCCGATGTGCCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGGACCTGA 

CCTCCATCCCAGACGGGCATCCCAGCCAATGTGGAACGAGTCAATTTAGGGTGTGTG 

GACCTTGCCTGATCTCCTrCTCAGAGAGGGACCACTGATTTTCCTGGTACTTTGCCCC 

CCAAACACCTGTGATTACTTTTAATAGTTTTCTTCTAAAATGGGTTCATACAAACCTT 

ATATTGTGGAGACAATGAACATTTTATCCCAATAGTCTTTTACTAGAACTTGAAGCC 

CCTCTTAGTTGTTTGGGAGCCTCATAATTATGGGGCAGCTTTATTCTGAATGAATTTT 

AAATGAAAAAGATACAGTTTCTGTTAACAATCATTATGATACCAAGGAAGAGGAAT 

TGTCATTGAATATTTTAAAAAAGCATTTCTTTTGCAATTTATAAATACCCATTACAAA 

ATGGCTTACTTAAAATACTTGCCTTACTAAATCTGACAAATTATGGTGATATTTTGAA 

GGTTTATGAAAATTTGTTTATGTGTATAAATGCACAAGAAATGGGATATGCCATCAC 

CTATGTGCCATTAGTGAGCATGTACAGTATGCCAAACACTATTGTTCACGTTTGGAG 

GAAGTAATGGGGGTGGGGGAGCAACAAGGGTTATAACCGTATACCCAGTGCCTTGG 

AAGCGATTGCAAACAGTAAAGACTGACATTGTGTTCTCCCTATGAGGGAGGGGCCTT 

GGGCTGAGCACTTTGCAATGAGCATTTGCTCATTGTGCTGGCAGGTTTTATGATAAC 

TTGACCCAAGCTAGAGTCACTGGAGAGGAAGGAACTTCAACTGAGAACATGCCTGA 

AGAAGATCAGATTATAGGCAGGCCTGTGGGGCATTTTCTTAATTAGTGATTCATGGG 

GCAGGGCCCAGTCCATTGTTCGTGGTACCATTTCTCAGGCACTATTAAAAAAAAAAA 

AACAGGCTGAGCAAGTGTCAAGGAGCAAGTCAGTGAGCAGCAGCCCTAATGATCTC 

TGCATCAGCTCCTGCCTCCAGGTTCCTACCCTATTTGAGTTCCTGTCCTAGCTCCCTA 

CAGTGATGAACAATGATGTGGAAGTATAAGCCAAATAAATCCTTTCTTCCCCAACTT 

GCTGTTGGTCATGATGTTTCATCACAGTGATAATAGTCCTCATGAAGATGCTGGTGT 

TTATAACACCTTTGGACTAAATTCTGTTATCTATAGCTGAGGAAAATGGAGCATAGA 

AAGTCTCCAGACTACACCAGAGTGTAATCTGGGCCTGAGCTTAGAATCACACCCAC 

GTGCACTCCACTGCCGGGGCTTCTTAACCGGAACACAGTTGTAAAAGGGAATTTTCT 

GTTTGTTTCCATTTTGACATGTGGACTTTAATTGACGATTCATCTGAAGCTGAAAATG 

ATTTTTTTTCCAGGTATAACAGCCTCACTAGATTGACAGAAAATGACTTTTCTGGCCT 

GAGCAGACTGGAGTTACTCATGCTGCACAGCAATGGCATTCACAGAGTCAGTGACA 

AGACCTTCTCGGGCTTGCAGTCCTTGCAGGTGAGATAGGTAGAGGGTGATGGAGGC 

TGAGAAGAGAGGTGCAACTGTGGGTTATACCCAAAAGCTGCTGATTCCCGTGGGAG 

ACATTCTATAAGCATTCTATAAACTAGAGGCAGATATCAAGGAAGGATTTCAATTGT 

AATGCAATTTTATGAGAAAATTTGAATATTAAGAAAATGCTGGGGAAAATGCTTAC 
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ACAATTGCGAGGACCTAATTTAGGATCTCCAATAGCCACATAAAAAGCACAGCATG 

GCGGCAGACACCTGCAATTCCTGTCCCTGGAAGCACCTGTTCAGAATCCCAGAGACT 

CATTGGCCAAACACTCTATTCAATCAATGAAGTCCATATTCAGTGACAAAACTTGAC 

TCAGAAACTAATGTGGAAAGCATCAGGAAGACAGCCAACATCTGGTCTCTACTCAT 

GCATGAATAAGGGATCCCAGAGAGAAGGGAAGAAAAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAGAGGGAGGAAAGGAGGG 

AGGGAAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAG 

GAAAGGGGGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATTTTCAGGGATTATT 

ACACCTTTAAATTTTATCCATAAAAGGTCATTTCCACCTGTTTGTCTGGAAGTAGAGT 

GGGATCCCTTATATAAGGGCAGTCTTTAACATAGTAGCATTTTATAAACCATTACAA 

ATTTTGAGTTTTCTCTACTTTTTATCCTCTACCATCTTCAAACTGA 

CCCACAAATGAAGAAAATGCTGTAAGAGTTTTCACACACCGAAGTGGGAAACTTAA 

GGATTAGACAAGTCTAACAATGAGAATGGGGAGAACAAAAAGAGACTGCACAGGG 

AGCCCTTTCTCTGCTTATAATCTTGACACTTGAGAAGCTAATTGACGCTGCATGACTA 

CTCAACTCTTTAAGCAAACAATGCTGTTGTTCATGAAAAGCACAATAAAGTACATAT 

GTCCCATAATATTCATCAAAATTTGCATGCAGCACATAATAGCAATCAAAGCAATAA 

CACCCACTGTTCACAGAGACTTTAAACATGAAACTGGAACTATGTCTAGTGTTTTGA 

CTTAGGGTACATAGTATGCTGTGTCTGTATGTACCAATGTTGATTTAGGTCATCAGA 

CAGCATTTGGAACATGTATCTTCAGGAGGAATCATTCATGTATCCTGCATGAAATTC 

TCCACCTATGTTTATTCTCTTAGCCAGGTTTTTCTCTGATGGAGAAACATTGGGTTTG 

AGGTTTTACTCCCAGGTAACATTTAGGGAAAAGCTGTCTATGTTCTCAGTTTGGCTTT 

TATTTATGAGGGATGTTGGTATTCCAGAAAATTCTCTTTTGAAGAGATTACAATTTA 

GGTCAAAACAGAAAAATATGTAAAAAGTTATTGTTTTTATTAGTATTTCATGTTCTTT 

AAATAATCAGAGAGGGATTTGATGAATGCTAAAGCATCATGAAAAATTCAAAATTT 

TTTGCTTCTAATTCAGAATCAATTAAATTCATATTACTATAAAAGACAGCACGCCAG 

ATGTGTGCCAGCTGAGGAGTGGATAAACTGTGTAACGTGAGTGCTATGTAGAAACA 

GAAAGGAGTGAAGGGTTGATGTGCGCTGCAACATCTTGAAAACATTCGGCTACATG 

ATGGAAGCCAGGCACAAAAAGCCACATATTGCATGGTTATGTTTATATGAAATGTTT 

AAAATACATGGATTCTTAGCAAACAGAGTAAGATGTTACTTAGGGTCAGGAAAAGA 

TTAAAAAAAAAAAAACTATTGATGTGGAATGATCTTAATTTGGGGAAAAGACAATT 

TCCTAAGACGAAATAGTTGAGGTAGATATAGTTATATCCCTGTGGATATTGTAATAA 

ACCAGCATGCTGTGCTCTGAGAAGGGCCTAATGAAGGGGCAGGAGGAAGTGAAATG 

AGATGGTAGAAAGGAAAGTCATATACCATGGCTTCTCTCGTGGGTGGAATCTAGAT 

ATGTTAATATATTGACATAAAGGAAGGAATTGTTTAGGGAAGGATCAAAACCAACA 

GGAGTGAGGGAGACAATAGGAACCAATGAGAGGCAAAGTTCATGGTCAATGTGTGT 

GGAGACACCATAATAAAACTCCTTTTTTGTTTGCTAACTAAAACCACTAAAATCTAA 

AAACAAAACATTTTTGCACAAGAATTATTTATTATTCAATAAAGATGTTTAAATGGG 

GGAAGTTGAAGTTCATTGATAGTCTCATAAATCTTAAATGTATTTAAACTGCTTTTTA 

CGTTTTTTATTATTAATTACTCTTGCTGTCATTATTATCATCATCATTATCGTCATCAT 
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CATCACTAATGCTTTTCACCATACACAAATGTAGGICAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTTTGTATTCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGGCTCAACAGTTATCTTGGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCACTTAGCGTTCTGAATCCAATCCAGGGCATGATGCTTACTCCT 

ACACAGAACTAAAGATTAAAGAGAGTTTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCGATAAGTTCTTAATATTTATGGCTGCTTGTGTATGTTCTAATTTCTCTAATATT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTATAAACTTTTCTCTCTTTGTGAAAGTTCCTTT 

GAGAAAACAAATATGCCCATATCTTTCTTTACAGGTCTTAAAAATGAGCTATAACAA 

AGTCCAAATAATTGAGAAGGATACTTTGTATGGACTCAGGAGCTTGACCCGGTTGCA 

CCTGGATCACAACAACATTGAGTTTATCAACCCCGAGGCGTTTTACGGACTCACCTT 

GCTCCGCTTGGTACATCTAGAAGGAAACCGGCTGACAAAGCTCCATCCAGACACAT 

TTGTCTCTTTGAGCTATCTCCAGATATTTAAAACCTCCTTCATTAAGNACCTGTACTT 

GTATGATAACTTCATTGACCTCCCTCCCAAAAGAAATGGTCTCCTCTATGCCAAACC 

TAGAAAGCCTTTACTTGCATGGAAACCCATGGACCTGTGACTGCCATTTAAAGTGGT 

TGTCCGAGTGGATGCAGGGAAACCCAGGTAACTATCTTGTTTGTTTGTITCTTTTTTT 

ATARKACGTATTTTCCTCAATTTCATTTAGAATGATATCCCAAAAGTCCCCCATAACC 

TCCCCCCCACTTCCCTACCTACCCATTCCCATTTTTTGGCCCTGGCATTCCCCTGTACT 

GGGGCATATAAAGTTTGCGTGTCCAATGGACCTCTCTTTCCAGTGATGGCCAACTAG 

GCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCTGGGGTACTGGTTAGTTC 

ATAATGTTGTTGCACCTACAGGGTTGAA (SEQ ID NO:20) 
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Figure 51 

MPKRAHWGALSVVLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIARHVE 

RINLGFNSIQAI^ETSFAGLTKLELLMfflGNEIPSIPDGALRDLSSLQWKFSYNKLRVITG 

QTLQGI^NLMRLHIDHNKIEFIHPQAFNGLTSLRLLHLEGNLLHQLHPSTFSTFTFLDYFR 

LSTIRHLYLAENMVRTLPASMLRNMPLLENLYLQGM'WTCDCEMRWFLEWDAKSRGIL 

KCKKDKAYEGGQLCAMCFSPKKLYKHEIHKLKDMTCLKPSIESPLRQNRSRSIEEEQEQ 

EEDGGSQLII^KFQLPQWSISLNMTDEHGNMVNLVCDIKKPMDVYKIHLNQTDPPDIDIN 

ATVALDFECPMTRENYEKLWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEAL 

YYTGVRAQILAEPEWVMQPSIDIQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRS 

WVMffiPSGAVQRDQTVLEGGPCQLSCNVKASESPSIFWVLPDGSILKAPMDDPDSKFSIL 

SSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGES 

VTLPCNALAIPEAHLSWILPNRRIINDLANTSHVYMLPNGTLSIPKVQVSDSGYYRCVAV 

NQQGADHFTVGITVTKKGSGLPSKRGRRPGAKALSRVREDIVEDEGGSGMGDEENTSR 

RLLHPKDQEVFLKTKDDAINGDKKAKKGRRKLKLWKHSEKEPETNVAEGRRVFESRRR 

INMANKQINPERWADILAKVRGKNLPKGTEVPPLIKTTSPPSLSLEVTPPFPAVSPPSASP 

VQTVTSAEESSADVPLLGEEEHVLGTISSASMGLEHNHNGVILVEPEVTSTPLEEVVDDL 

SEKTEEITSTEGDLKGTAAPTLISEPYEPSPTLHTLDTVYEKPTHEETATEGWSAADVGSS 

PEPTSSEYEPPLDAVSLAESEPMQYFDPDLETKSQPDEDKMKEDTFAHLTPTPTIWVNDS 

STSQLFEDSTIGEPGVPGQSHLQGLTDNIHLVKSSLSTQDTLLIKKGMKEMSQTLQGGNM 

LEGDPTHSRSSESEGQESKSrTLPDSTLGMSSMSPVKKPAETTVGTIiDKDTTTVTTTPR 

QKVAPSSTMSTFEPSRRRPNGRRRLRPNKF^HRHKQTPPTTFAPSETFSTQPTQAPDIKISS 

QVESSLVPTAWVDNTVNTPKQLEMEKNAEPTSKGTPRRKHGKRPNKHRYTPSTVSSRA 

SGSKPSPSPENKHRNWTPSSETILLPRTVSLKTEGPYDSIJDYMTTTRKIYSSYPKVQETLP 

VTYKPTSDGKEIKDDVATNVDKHKSDILVTGESITNAIPTSRSLVSTMGEFKEESSPVGFP 

GTPTWNPSRTAQPGRLQTDIPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEE 

AGSSTTI^SIKVEVASSQAETTTLDQDHLETTVAILLSETRPQNHTPTAARMKEPASSSPS 

TILMSLGQTTTTKPALPSPRISQASRDSKENVFLNYVGNPETEATPVNNEGTQHMSGPNE 

LSTPSSDRDAFNLSTKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTA 

TVRLPEMSTQSASRYFVTSQSPRHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHM 

SKPSIPSKFTDRRTDQFNGYSKVFGNNNIPEARNPVGKPPSPRIPHYSNGRLPFFTNKTLSF 

PQLGVTRRPQIPTSPAPVMRERKVIPGSYNRfflSHSTFHLDFGPPAPPLLHTPQTTGSPSTN 

LQNIPMVSSTQSSISFITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 

TAETDTVFPCEATGKPKPFVTWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 

QYMCTASNLHGLDRMVVLLSVTVQQPQILASHYQDVTVYLGDTIAMECLAKGTPAPQI 

SWIFPDRRVWQTVSPVESRITLHENRTLSIKEASFSDRGVYKCVASNAAGADSLAIRLHV 

AALPPVfflQEKLENISLPPGLSIHfflCTAKAAPLPSVRWVLGDGTQIRPSQFLHGNLFVFP 

NGTLYIRNLAPKDSGRYECVAANLVGSARRTVQLNVQRAAANARITGTSPRRTDVRYG 

GTLKLDCSASGDPWPRILWRLPSKRMIDALFSFDSRIKVFANGTLVVKSVTDK 
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DAGDYLCVARNKVGDDYVVLKVDVVMKPAKIEHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWS1PDGSLVNSFMQSDDSGGRTKRYVVFNNGTLYFNEVGMREEGDYTCFAEN 

QVGKDEMRVRVKVVTAPATIRNKTYLAVQVPYGDVVTVACEAKGEPMPKVTWLSPTN 

KVIPTSSEKYQIYQDGTLLIQKAQRSDSGNYTCLVRNSAGEDRKTVWIHVNVQPPKING 

NPNPITTVREIAAGGSRKLIDCKAEGIPTPRVLWAFPEGVVLPAPYYGNRITVHGNGSLDI 

RSLRKSDSVQLVCMARNEGGEARLIVQLTVLEPMEKPMn)PISEKITAMAGHTISLNCS 

AAGTPTPSLVWVIPNGTDLQSGQQLQRFYHKADGMLfflSGLSSVDAGAYRCVARNAA 

GHIERLVSLKVGLKPEANKQYHNLVSIINGETLKLPCTPPGAGQGRFSWTLPNGMHLEG 

PQTLGRVSLIX)NGTLTVREASVFDRGTYVCRMETEYGPSVTSIPVIVIAYPPRITSEPTPVI 

YTRPGNTVKLNCMAMGIPKADITWELPDKSHLKAGVQARLYGNRFLHPQGSLTIQHAT 

QRDAGFYKCMAKNELGSDSKTTYIHVF (SEQ ID NO: 21) 
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Figure 52 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC AGACTCTCCA 
CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
AATCCAGAGG AATTCTGAAG TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT 
CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA GAGTCCCCTC 
TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGAAG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA. 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAAAG GTCCAAGTCA GTGATAGTGG 
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TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 
GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 
AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 
GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 
CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
CAATCACATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 
AAACCACAGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTCTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 
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TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
TTTCACTTCC GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 
TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC 
TCCCAGTCCA AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCCAGG 
TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAAC A TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC CAAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 
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CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 
GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
CACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG 
TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
GGGAAGGACG AGATGAGAGT CAGAGTCAAG GTGGTGACAG CGCCCGCCAC 
CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT 
ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
GTGTGGATTC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCCATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 
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GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA 
ACATTCTCGG CAGTGACTCC AAAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCAATT TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA AAAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 
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Figure 5r 
Human OCP; nucleotide sequence of ORF 

ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGC 

CTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCT 

ACGGAGGTACACTGCACATTTCGGTACCTGACTTCCATCCCAGACAGCATCCCGCCC 

AATGTGGAACGCATCAATTTAGGATACAACAGCTTGGTTAGATTGATGGAAACAGAT 

TTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGCATTCACACA 

ATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTAT 

AATAAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGA 

TTGCACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTC 

AACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCTCCACCCAGAT 

ACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTCTTTCATTAAGTTCCTA 

TACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTATATGCCT 

GACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGC.CATTTAAAG 

TGGTTGTCTGACTGGATACAGGAGAAGCCAGATGTAATAAAATGCAAAAAAGATAGA 

AGTCCCTCTAGTGCTCAGCAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGC 

AAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGAC 

TCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATC 

TCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAG 

TCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCC 

ATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTT 

TTGGTGTGCAACATAGATTACGGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTG 

TACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTGCTTAGTGAAACACCGCAG 

CTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACCAACATA 

GAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 

CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

CAAATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATT 

TCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTAGGTGGAACCGTTGGC 

CTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGATTGGCTTCTAGCTGAT 

GGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGACAAA 

AGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGT 

ATAAGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAA 

CCTT TGGT CGAAGCCT AT CAGGAAAATGGGATTCATCAC AC AGTT TT CATTGGT GAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGCTGGGTT 

ATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAAGTTCTAAACAAT 

GGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 

GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAA 

GGACAAAGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCC 

AATCCTATTGCTCATCTTAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTG 

ATGGAGGCTGAGGTTGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGG 

GAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGGGAGAATAGG 

AGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTG 

GAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCA 

CCCCCAGTGGTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGC 

ATGCTCGCTCTACATGAGGAATTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCA 

GCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATA 

AATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAA 

CCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 



AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCMCATTCATCCACTGTC 

TTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGA 

AGAGAGCATTTCCAAAGTAGACCCCCAATAACAGTAAGGACTATGATCAAAGATGTC 

AATGTCAAAATGCTTAGTAGCACCACCAACAAACTATTATTAGAGTCAGTAAATACC 

ACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTC 

TATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACA 

GCTGCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTC 

AGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATCAGCCCATAT 

AGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAGGTCAACAACCAGAGGT 

TCTTCTGAAAAAAGCACTACTGCATTCTCAGCCACAGTGCTCAATGTGACATGTCTG 

TCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGCAGCATTGTCTTTTCCAAGT 

GCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCT 

ACAACTCTAGTCCAGAATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAA 

ACAACACCCACAATAAAATATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGT 

GCAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAAGTAAAC 

GCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGATTCAGTGATTACA 

TCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACAAGG 

TTTTCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACCATAACCCAAAA 

GGCAGATTAAGGAATCAACATAAAGTTAGTTTACAAAAAAGCACAGCTGTGATGCTT 

CCTAAAACATCTCCTGCTTTACCACAGAGACAAAGTTCCCCTTTCCATTTCACCACA 

CTTTCAACAAGTGTGATGCAAATTCCATCTAATACCTTGACTACCGCTCACCACACT 

ACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCCA 

CCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATC 

ATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATC 

ACTTATGAAACCCAAACAGAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAG 

CCTCAAAAGAAGAACAGGACTGACCCAAACATCTCTCCAGACCAGAGTTCTGGCTTC 

ACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAA 

AACACAACTGGGATTTCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACA 

GATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCT 

TCTGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCA 

TTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTC.ee 

ATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACACGTGGCGACTCCCATTTC 

CGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGGCACAATCTCCAA 

ATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 

ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCT 

CCCTGGGCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGC 

AAAAAGCCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTT 

GTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATAAGAAACCA 

GTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCTAGGTATATA 

TTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCTAAC 

TCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGG 

ACCAGAGTTTCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTT 

CTCCCCAATGGTACCCTGTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTAC 

TTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTG 

GTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCATTCC 

GGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 

TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCT 

GTGGTGACGGTTGACGGAACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGC 

TTTTACAAATGTGTGGCCAGCAACCCAGGTGGCCAGGATTCACTGCTGGTTAAAATA 

CAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGC 

ACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGC 



GTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCC 

AAGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGAC 

AGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTA 

ATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGATAGAAGCTGCATCCCAGAAA 

AGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAACTGCTCAGCCACTGGGGAG 

CCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAGTGGAGC 

TGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGAC 

AGTGGTGTCTACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATG 

CATGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAG 

CAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCCGGCTCCCCAGTG 

CCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCAATGCAAGCC 

GATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATAC 

TTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACC 

CTAGGGAAAGATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATA 

AGGCAGAGTAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGAC 

TGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCCAATGAC 

ATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTGACCATC 

AACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 

GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAAT 

GGTCTGTATACAAACAGAACTGTTATTAAAGCCACAGCTGTGAGACATTCCAAAAAA 

CACTTTGACTGCAGAGCTGAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCA 

GACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAAT 

GGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTGTG 

GCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATG 

CTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGA 

AAGTCCACAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGG 

ATTTTACCAAATGGCACACGATTTTCCAATGGACCACAAAGTTATCAGTATCTGATA 

GCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGGGAGGATGCAGGAAAATAT 

CGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 

GGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGA 

GAATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGG 

ACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTG 

CATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGCTTATGACAGAGGAAACTAT 

ATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCAGTAATGATT 

GTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACA 

GGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACA 

TGGGAGATGCCTGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGA 

AGTGAGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCCGAT 

TCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTATGCAGCAACG 

TATATTCAAGTAATCTGA 
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Human OCP: predicted amino acid sequence 

MKVKGRGITC LLVSFAVICL VATPGGKACP RRCACYMPTE VHCTFRYLTS 
IPDSIPPNVE RINLGYNSLV RLMETDFSGL TKLELLMLHS NGIHTIPDKT 
FSDLQALQVL KMSYNKVRKL QKDTFYGLRS LTRLHMDHNN IEFINPEVFY 
GLNFLRLVHL EGNQLTKLHP DTFVSLSYLQ IFKISFIKFL YLSDNFLTSL 
PQEMVSYMPD LDSLYLHGNP WTCDCHLKWL SDWIQEKPDV IKCKKDRSPS 
SAQQCPLCMN PRTSKGKPLA MVSAAAFQCA KPTIDSSLKS KSLTILEDSS 
SAFISPQGFM APFGSLTLNM TDQSGNEANM VCSIQKPSRT SPIAFTEEND 
YIVLNTSFST FLVCNIDYGH IQPVWQILAL YSDSPLILER SHLLSETPQL 
YYKYKQVAPK PEDIFTNIEA DLRADPSWLM QDQISLQLNR TATTFSTLQI 
QYSSDAQITL PRAEMRPVKH KWTMISRDNN TKLEHTVLVG GTVGLNCPGQ 
GDPTPHVDWL LADGSKVRAP YVSEDGRILI DKSGKLELQM ADSFDTGVYH 
CISSNYDDAD ILTYRITVVE PLVEAYQENG IHHTVFIGET LDLPCHSTGI 
PDASISWVIP GNNVLYQSSR DKKVLNNGTL RILQVTPKDQ GYYRCVAANP 
SGVDFLIFQV SVKMKGQRPL EHDGETEGSG LDESNPIAHL KEPPGAQLRT 
SALMEAEVGK HTSSTSKRHN YRELTLQRRG DSTHRRFREN RRHFPPSARR 
IDPQHWAALL EKAKKNAMPD KRENTTVSPP PVVTQLPNIP GEEDDSSGML 
ALHEEFMVPA TKALNLPART VTADSRTISD SPMTNINYGT EFSPVVNSQI 
LPPEEPTDFK LSTAIKTTAM SKNINPTMSS QIQGTTNQHS STVFPLLLGA 
TEFQDSDQMG RGREHFQSRP PITVRTMIKD VNVKMLSSTT NKLLLESVNT 
TNSHQTSVRE VSEPRHNHFY SHTTQILSTS TFPSDPHTAA HSQFPIPRNS 
TVNIPLFRRF GRQRKIGGRG RIISPYRTPV LRRHRYSIFR STTRGSSEKS 
TTAFSATVLN VTCLSCLPRE RLTTATAALS FPSAAPITFP KADIARVPSE 
ESTTLVQNPL LLLENKPSVE KTTPTIKYFR TEISQVTPTG AVMTYAPTSI 
PMEKTHKVNA SYPRVSSTNE AKRDSVITSS LSGAITKPPM TIIAITRFSR 
RKIPWQQNFV NNHNPKGRLR NQHKVSLQKS TAVMLPKTSP ALPQRQSSPF 
HFTTLSTSVM QIPSNTLTTA HHTTTKTHNP GSLPTKKELP FPPLNPMLPS 
IISKDSSTKS IISTQTAIPA TTPTFPASVI TYETQTERSR AQTIQREQEP 
QKKNRTDPNI SPDQSSGFTT PTAMTPPALA FTHSPPENTT GISSTISFHS 
RTLNLTDVIE ELAQASTQTL KSTIASETTL SSKSHQSTTT RKASLDTPIP 
PFLSSSATLM PVPISPPFTQ RAVTDTRGDS HFRLMTNTVV KLHESSRHNL 
QMPSSQLEPL TSSTSNLLHS TPMPALTTVK SQNSKLTPSP WAEYQFWHKP 
YSDIAEKGKK PEVSMLATTG LSEATTLVSD WDGQKNTKKS DFDKKPVQEA 
TTSKLLPFDS LSRYIFEKPR IVGGKAASFT IPANSDAFLP CEAVGNPLPT 
IHWTRVSGLD LSRGNQNSRV QVLPNGTLSI QRVEIQDRGQ YLCSASNLFG 
TDHLHVTLSV VSYPPRILER RTKEITVHSG STVELKCRAE GRPSPTVTWI 
LANQTVVSES SQGSRQAVVT VDGTLVLHNL SIYDRGFYKC VASNPGGQDS 
LLVKIQVIAA PPVILEQRRQ VIVGTWGESL KLPCTAKGTP QPSVYWVLSD' 
GTEVKPLQFT NSKLFLFSNG TLYIRNLASS DRGTYECIAT SSTGSERRVV 
MLTMEERVTS PRIEAASQKR TEVNFGDKLL LNCSATGEPK PQIMWRLPSK 
AVVDQWSWIH VYPNGSLFIG SVTEKDSGVY LCVARNKMGD DLILMHVSLR 
LKPAKIDHKQ YFRKQVLHGK DFQVDCKASG SPVPEISWSL PDGTMINNAM 
QADDSGHRTR RYTLFNNGTL YFNKVGVAEE GDYTCYAQNT LGKDEMKVHL 
TVITAAPRIR QSNKTNKRIK AGDTAVLDCE VTGDPKPKIF WLLPSNDMIS 
FSIDRYTFHA NGSLTINKVK LLDSGEYVCV ARNPSGDDTK MYKLDVVSKP 
PLINGLYTNR TVIKATAVRH SKKHFDCRAE GTPSPEVMWI MPDNIFLTAP 
YYGSRITVHK NGTLEIRNVR LSDSADFICV ARNEGGESVL VVQLEVLEML 
RRPTFRNPFN EKIVAQLGKS TALNCSVDGN PPPEIIWILP NGTRFSNGPQ 
SYQYLIASNG SFIISKTTRE DAGKYRCAAR NKVGYIEKLV ILEIGQKPVI 



LTYAPGTVKG ISGESLSLHC VSDGIPKPNI KWTMPSGYVV DRPQINGKYI 
LHDNGTLVIK EATAYDRGNY ICKAQNSVGH TLITVPVMIV AYPPRITNRP 
PRSIVTRTGA AFQLHCVALG VPKPEITWEM PDHSLLSTAS KERTHGSEQL 
HLQGTLVIQN PQTSDSGIYK CTAKNPLGSD YAATYIQVI* 



